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Preface 



Jim Byrnes 

The chapters in this volume were presented at the July 2003 NATO 
Advanced Study Institute on Computational Noncommutative Algebra 
and Applications. The conference was held at the beautiful II Ciocco re- 
sort near Lucca, in the glorious Tuscany region of northern Italy. Once 
again we gathered at this idyllic spot to explore and extend the reci- 
procity between mathematics and engineering. The dynamic interaction 
between world-renowned scientists from the usually disparate commu- 
nities of pure mathematicians and applied scientists, which occurred at 
our 1989, 1991, 1992, 1998, and 2000 ASTs, continued at this meeting. 

The fusion of algebra, analysis and geometry, and their application 
to real world problems, have been dominant themes underlying mathe- 
matics for over a century. Geometric algebras, introduced and classified 
by Clifford in the late 19th century, have played a prominent role in 
this effort, as seen in the mathematical work of Cartan, Brauer, Weyl, 
Chevelley, Atiyah, and Bott, and in applications to physics in the work 
of Pauli, Dirac and others. One of the most important applications of 
geometric algebras to geometry is to the representation of groups of Eu- 
clidean and Minkowski rotations. This aspect and its direct relation to 
robotics and vision were discussed by several of the Principal Lecturers, 
and are covered in this book. 

Moreover, group theory, beginning with the work of Burnside, Frobe- 
nius and Schur, has been influenced by even more general problems. As 
a result, general group actions have provided the setting for powerful 
methods within group theory and for the use of groups in applications 
to physics, chemistry, molecular biology, and signal processing. These 
aspects, too, are covered in what follows. 

With the rapidly growing importance of, and ever expanding con- 
ceptual and computational demands on signal and image processing in 
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remote sensing, computer vision, medical image processing, and biologi- 
cal signal processing, and on neural and quantum computing, geometric 
algebras, and computational group harmonic analysis, the topics of the 
following chapters have emerged as key tools. The authors include many 
of the world’s leading experts in the development of new algebraic mod- 
eling and signal representation methodologies, novel Fourier-based and 
geometric transforms, and computational algorithms required for realiz- 
ing the potential of these new application fields. 

The ASI brought together these world leaders from both academia 
and industry, with extensive multidisciplinary backgrounds evidenced by 
their research and participation in numerous workshops and conferences. 
This created an interactive forum for initiating new and intensifying ex- 
isting efforts aimed at creating a unified computational noncommutative 
algebra for advancing the broad range of applications indicated above. 
The forum provided opportunities for young scientists and engineers to 
learn more about these problem areas, and the vital role played by new 
mathematical insights, from the recognized experts in this vital and 
growing area of both pure and applied science. 

The talks and the following chapters were designed to address an au- 
dience consisting of a broad spectrum of scientists, engineers, and math- 
ematicians involved in these fields. Participants had the opportunity to 
interact with those individuals who have been on the forefront of the 
ongoing explosion of work in computational noncommutative algebra, 
to learn firsthand the details and subtleties of this exciting area, and 
to hear these experts discuss in accessible terms their contributions and 
ideas for future research. This volume offers these insights to those who 
were unable to attend. 

The cooperation of many individuals and organizations was required 
in order to make the conference the success that it was. First and 
foremost I wish to thank NATO, and especially Dr. F. Pedrazzini and 
his most able assistant, Ms. Alison Trapp, for the initial grant and 
subsequent help. Financial support was also received from the Air 
Force Office of Scientific Research (Dr. Jon Sjogren), the Defense Ad- 
vanced Research Projects Agency (Dr. Douglas Cochran), the National 
Science Foundation (Dr. Sylvia Wiegand), USAF Rome Laboratories 
(Drs. Michael Wicks, Braham Himed, and Gerard Genello), US Army 
SMDC (Drs. Pete Kirkland and Robert McMillan), FOARD (Dr. Chris 
Reuter), the Raytheon Company (Dr. David N. Martin), Bielefeld Uni- 
versity (Professor Andreas Dress), Bonn University (Professor Michael 
Clausen), Melbourne University, and Prometheus Inc. This additional 
support is gratefully acknowledged. 
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I wish to express my sincere appreciation to my assistants, Marcia 
Byrnes and Gerald Ostheimer, for their invaluable aid. I am also grate- 
ful to Kathryn Hargreaves, our T^^nician, for her superlative work in 
preparing this volume. Finally, my heartfelt thanks to the II Ciocco 
staff, especially Bruno Giannasi and Alberto Suffredini, for offering an 
ideal setting, not to mention the magnificent meals, that promoted the 
productive interaction between the participants of the conference. All of 
the above, the speakers, and the remaining conferees, made it possible 
for our Advanced Study Institute, and this volume, to fulfill the stated 
NATO objectives of disseminating advanced knowledge and fostering 
international scientific contacts. 

September 22, 2003 Jim Byrnes, Newport, Rhode Island 
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Abstract Given a quadratic form on a vector space, the geometric algebra of 
the corresponding pseudo-euclidean space is defined in terms of a sim- 
ple set of rules which characterizes the geometric product of vectors. 
We develop geometric algebra in such a way that it augments, but re- 
mains fully compatible with, the more traditional tools of matrix alge- 
bra. Indeed, matrix multiplication arises naturally from the geometric 
multiplication of vectors by introducing a spectral basis of mutually 
annihiliating idempotents in the geometric algebra. With the help of 
a few more algebraic identities, and given the proper geometric inter- 
pretation, the geometric algebra can be applied to the study of affine, 
projective, conformal and other geometries. The advantage of geometric 
algebra is that it provides a single algebraic framework with a compre- 
hensive, but flexible, geometric interpretation. For example, the affine 
plane of rays is obtained from the euclidean plane of points by adding 
a single anti-commuting vector to the underlying vector space. The key 
to the study of noneuclidean geometries is the definition of the oper- 
ations of meet and join, in terms of which incidence relationships are 
expressed. The horosphere provides a homogeneous model of euclidean 
space, and is obtained by adding a second anti-commuting vector to the 
underlying vector space of the affine plane. Linear orthogonal trans- 
formations on the higher dimensional vector space correspond to con- 
formal or Mobius transformations on the horosphere. The horosphere 
was first constructed by F.A. Wachter (1792-1817), but has only re- 
cently attracted attention by offering a host of new computational tools 
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in projective and hyperbolic geometries when formulated in terms of 
geometric algebra. 

Keywords: affine geometry, Clifford algebra, conformal geometry, conformal group, 
euclidean geometry, geometric algebra, horosphere, Mobius transforma- 
tion, non-euclidean geometry, projective geometry, spectral decomposi- 
tion. 

1. Geometric algebra 

A Geometric algebra is generated by taking linear combinations of 
geometric products of vectors in a vector space taken together with a 
specified bilinear form. Here we shall study the geometric algebras of 
the pseudo- euclidean vector spaces Gp^g := Qp^g{lRF'‘^) for which we have 
the indefinite metric 

p p+q 

X • y = ^ XiVi - ^ XjVj 

2=1 Jf=p+1 

for X = (xi • • • Xp+g) and y = [yi • • • Vp+q) in We first 

study the geometric algebra of the more familiar Euclidean space. 

1.1 Geometric algebra of Euclidean n-space 

We begin by introducing the geometric algebra Qn := Q[1R^) of the 
familiar Euclidean n-space 

iR” = {x| X = (xi • • • Xn) for Xi G M}. 

Recall the dual interpretations of each element x G iR”, both as a point 
of iR” with the coordinates (xi • • • x^) and as the position vector or 

directed line segment from the origin to the point. We can thus express 
each vector x G iR” as a linear combination of the standard orthonormal 
basis vectors {ei,e2,-'' where e* = (O ••• 0 0 ••• O), 

namely 

n 

X = y^xjCj. 

2=1 

The vectors of are added and multiplied by scalars in the usual 
way, and the positive definite inner product of the vectors x and y = 
(yi • • • 2/n) is given by 

n 

x-y = 

2=1 



( 1 ) 
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The geometric algebra Qn is generated by the geometric multiplication 
and addition of vectors in In order to efficiently introduce the geo- 
metric product of vectors, we note that the resulting geometric algebra 
Qn is isomorphic to an appropriate matrix algebra under addition and 
geometric multiplication. Thus, like matrix algebra, Qn is an associative, 
but non-commutative algebra, but unlike matrix algebra the elements of 
Qn are assigned a comprehensive geometric interpretation. The two fun- 
damental rules governing geometric multiplication and its interpretation 
are: 

■ For each vector x G iR”, 

n 

= XX = \x\^ = ^ xj ( 2 ) 

i=l 

where \x\ is the usual Euclidean norm of the vector x. 

■ If oi, 02, . . . , Ofc G iR” are k mutually orthogonal vectors, then the 
product 

Ak = ai02 ■ ■ - Ok ( 3 ) 

is totally antisymmetric and has the geometric interpretation of a 
simple k -vector OT a directed k-plane 

Let us explore some of the many consequences of these two basic rules. 
Applying the first rule ( 2 ) to the sum o -|- 6 of the vectors a, 6 G we 
get 

(a -|- 6)^ = ab ba 6^, 
or 

a ■ b := 2 ~ 

which is a statement of the famous law of cosines . In the special case 
when the vectors a and b are orthogonal, and therefore anticommutative 
by the second rule ( 3 ), we have ab = —ba and a • 6 = 0 . 

If we multiply the orthonormal basis vectors ei2 := eiC2, we get the 2 - 
vector or bivector ei2, pictured as the directed plane segment in Figure 1 . 
Note that the orientation of the bivector ei2 is counterclockwise, and 
that the bivector 621 := 6261 = —6162 = — ei2 has the opposite or 
clockwise orientation. 



^This means that the product changes its sign under the interchange of any two of the 
orthogonal vectors in its argument. 
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Figure 1. The directed plane segment ei2 = 6162. 

We can now write down an orthonormal basis for the geometric alge- 
bra Qn, generated by the orthonormal basis vectors {ej| 1 < i <n}. In 
terms of the modified cartesian-like product, x”^j^(l,ei) := 

{1, Cl , , Cfj, Ci2, • • • , 6(n— l)n) ■> ■ ■ ■ : 

There are 




linearly independent elements in the standard orthonormal basis of Qn- 
Any multivector or geometric number g £ Qn can be expressed as a sum 
of its homogeneous A:- vector parts, 

9 = 9i) + "' + 9k + "' + 9n 

where g^ ■=< g >k= Yla “0-60- where a = a\ ■■■ for 1 < ui < • • • < 
(Tfc < n, and G IR. The real part go :=< g > 0 = ctoco = cto of the 
geometric number g is just a real number, since cq := 1. By definition, 
any /c-vector can be written as a linear combination of simple A:-vectors 
or k-blades , [8, p.4]. 

Given two vectors a,b G IR”, we can decompose the vector a into 
components parallel and perpendicular to 6, a = oy -|- a±, where 

ail = (a • 6)-^ = (a • b)b~^, 

II ^|^|2 

and a_L := a — ay, see Figure 2. 

With the help of (3), we now calculate the geometric product of the 
vectors a and b, getting 
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Figure 2. Decomposition of a into parallel and perpendicular parts. 





Figure 3. The bivectors af\b and bf\a. 

where the outer product af\b := ^{ah — ha) = aj_6 = — 6aj_ = —hf\a 
is the bivector shown in Figure 3. The basic formula (4) shows that 
the geometric product ab is the sum of a scalar and a bivector part 
which characterizes the relative directions of a and b. If we make the 
assumption that a and b lie in the plane of the bivector ei 2 , then we can 
write 

ab = |a||6|(cos ip + I sin (p) = laUftle^*^, (5) 

where I := ei 2 = eiC 2 has the familiar property that 

= 61626162 = -61626261 = -efel = - 1 . 

Equation (5) is the Euler formula for the geometric multiplication of 
vectors. 

The definition of the inner product a ■ b and outer product aAb can 
be easily extended to a- and aABr, respectively, where r > 0 denotes 
the grade of the r- vector Bj.'- 

Definition 2 The inner product or contraction a- Br of a vector a with 
an r-vector Br is determined by 

a-Br = ^{aBr + (-l)^+^5^a) = {-l)''+^Br ■ a. 

Definition 3 The outer product aABr of a vector a with an r-vector 
Br is determined by 

aABr = ^{aBr - Aa. 
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Note that a ■ (3 = fi ■ a = 0 and oA/3 = flAa = f 3 a for the scalar 
f 3 G M. Indeed, we will soon show that a • B^. =< aBj. >r-i and 
aABr =< aBj. >r+i for all r > 1; we have already seen that this is true 
when r = 1. There are different conventions regarding the use of the dot 
product and contraction [5, p. 35]. 

One of the most basic geometric algebras is the geometric algebra 
Qs of 3 dimensional Euclidean space which we live in. The complete 
standard orthonormal basis of this geometric algebra is 

Gs = Xi=i(l,ej) = span{l, 61 , 62 , 63, 612,613,623, ei23}- 

Any geometric number g £ Gs has the form g = a + vi + iv 2 + Pi where 
i ■= ei 23 . Notice that we have expressed the bivector part of g as the 
dualoi the vector V 2 - Thus the geometric number g = {a+iP)-\-{v\+iv2) 
can be expressed as the sum of its complex scalar part (a + iP) and a 
complex vector part (ui + iv2)- Note that the complex scalar part has 
all the properties of an ordinary complex number z = x + iy. This 
follows easily from the fact that the pseudoscalar i = 6123 satisfies P = 
61236123 = 623623 = — 1- 

We can use the Euler form (5) to see that 

a = a{b~^b) = {ab~^)b = (^)b, 

so the geometric quantity ab/\b\‘^ rotates and dilates the vector b into 
the vector a when multiplied by b on the left. Similarly, multiplying b on 
right by also rotates and dilates the vector b into the vector a. By re- 
expressing this result in terms of the Euler angle cp, letting I = ie^, and 
assuming that joj = |6|, we can write a = exp{ie3ip)b = bexp{—ie3(p). 
Even more powerfully, and more generally, we can write 

a = exp(/(/?/2)6exp(— /(/?/2), 

which expresses the ^-angle formula for rotating the vector b £ in 
the plane of the simple bivector I through the angle (p. There are many 
more formulas for expressing reflexions and rotations in iR”, or in the 
pseudo-euclidean spaces [8], [10]. 

3.1 Basic algebraic identities 

One of the most difficult aspects of learning geometric algebra is com- 
ing to terms with a host of unfamiliar algebraic identities. These im- 
portant identities can be quickly mastered if they are established in a 
careful systematic way. The most important of these identities follows 
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easily from the following two trivial algebraic identities involving the 
vectors a and b and an r-blade where r > 0: 

obBr + hB^a = {ab + ba)Br — b{aBr — B^a), (6) 

and 

baBr — aBrb = {ba + ab)Br — a{bBr + Brb). (7) 

Whereas these identities are valid for general r- vectors, we state them 
here only for a simple r- vector Br, the more general case following by 
linear superposition. 

In proving the identities below, we use the fact that 

bBr =< bBr >r-l + < bBr >r+l • (8) 

This is easily seen to be true if B^ is an r-blade, in which case Br = 
b\ - ■ - hr for r orthogonal, and therefore anticommuting, vectors b\, ... ,br- 
We then simply decompose the vector 6 = 6|| -|- b± into parts parallel and 
perpendicular to the subspace of iR” spanned by the vectors b\, ... ,br, 
and use the anticommutivity of the b's to show that b • Br = b^^Br = 
< >r-i and bABr = b±Br =< b±Br >r+i- This also shows the 

useful result that b ■ Br and bABr are blades whenever Br is a blade. 
The following basic identity relates the inner and outer products: 

a ■ (bABr) = (a • b)Br — bA{a ■ Br), (9) 

for all r > 0. If r = 0, (9) follows from what has already been established. 
If r > 2 and even, (6) and definition (2) implies that 

2a • (bBr) = 2(a • b)Br — 2b(a ■ Br). 

Taking the r- vector part of this equation gives (9). If r > 1 and odd, 
(7) implies that 



2b • (aBr) = 2(a • b)Br — 2a(b • Br) 

which implies (9) by again taking the r-vector part of this equation 
and simplifying. By iterating (9), we get the important identity for 
contraction 



n 

a ■ (biA ■ ■ ■ Abn) = ^(-l)*+^(a -bi) 6iA • • • i • • • A6„. 
i=l 

Let I = ei 2 ...n be the unit pseudoscalar element of the geometric 
algebra Qp^q = Q{1RP’‘^). We give here a number of important identities 
relating the inner and outer products which will be used later in the 




COMPUTATIONAL NONCOMMUTATIVE ALGEBRA 



contexts of projective geometry. For an r-blade Ar, the {p + q — r)- 
blade A* := Arl~^ is called the dual of Ar in Gp^g with respect to the 
pseudoscalar I. Note that it follows that I* = 1 = 1 and 1 * = 

For r + s < p + we have the important identity 

{ArCBsT = {ArCBs)!-^ = Ar-{BsI-^) = Ar ■ B*, = 

(10) 

3.2 Geometric algebras of psuedoeuclidean 
spaces 

All of the algebraic identities discussed for the geometric algebra Qn 
hold in the geometric algebra with indefinite signature Qp^q- However, 
some care must be taken with respect to the existence of non-zero null 
vectors . A non-zero vector n G is said to be a null vector if 

= n-n = 0 . The inverse of a non-null vector u is so clearly a 

null vector has no inverse. The spacetime algebra of also called 
the Dirac algebra, has many applications in the study of the Lorentz 
transformations used in the special theory of relativity. Whereas non- 
zero null vectors do not exist in IR'^, there are many non-zero geometric 
numbers g G Gn which are null. For example, let g = ci A- e±2 G Gs, then 

= (ei + ei2)(ei -h 612) = ef -h Ci2 = 1 - 1 = 0. 



Let us consider in more detail the spacetime algebra Gi,3 of The 

standard orthonormal basis of are the vectors {ei, 62, 63, 64}, where 
ef = 62 = 63 = — 1 = —64. The standard basis of the bivectors ^43 of 
^1^3 are 644, 624, 634, iei4, ic24, ^634, where i = 64234 is the pseudoscalar of 
^4^3. Note that the first 3 of these bivectors have square -|- 1 , where as the 
duals of these basis bivectors have square — 1 . Indeed, the subalgebra of 
^4^3 generated by £'4 = 644, £2 = 642, £3 = 643 is algebraically isomor- 
phic to the geometric algebra Gs of space. This key relationship makes 
possible the efficient expression of electromagnetism and the theory of 
special relativity in one and the same formalisms. 

An important class of pseudo-euclidean spaces consists of those that 
have neutral signature, Gn,n = Gn,n{dR^’^)- The simplest such algebra 
is Gi,i with the standard basis {1,64,62,642}, where ef = 1 = —62 and 
6^2 = 1 - We shall shortly see that Gi,i is the basic building block for 
extending the applications of geometric algebra to affine and projective 
and other non-euclidean geometries, and for exploring the structure of 
geometric algebras in terms of matrices. 
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3.3 Spectral basis and matrices of geometric 
algebras 

Until now we have only discussed the standard basis of a geometric 
algebra Q. The standard basis is very useful for presenting the basic rules 
of the algebra and its geometric interpretation as a graded algebra of 
multivectors of different grades, a /c-blade characterizing the direction of 
a /c-dimensional subspace. There is another basis for a geometric algebra, 
called a spectral basis , that is very useful for relating the structure of 
a geometric algebra to corresponding isomorphic matrix algebras [15]. 
Another term that has been applied is spinor basis , but I prefer the 
term “spectral basis” because of its deep roots in linear algebra [17]. 

The key to constructing a spectral basis for any geometric algebra Q 
is to pick out any two elements u,v G G such that = 1, v~^ exists, 
and uv = —vu / 0. We then define the idempotents = ^(1 + u) and 
U- = ^(1 — u) in G, and note that 

= u-,u^u- = U-U+ = 0, and = 1. 

We say that u± are mutually annihiliating idempotents which partition 
1. Also = u_u, from which it follows that = U-V~^ . 

Using these simple algebraic properties, we can now factor out a 2 x 2 
matrix algebra from G- Adopting matrix notation, first note that 

(1 v) u+ = (u+ VU+) =u+ + u- = l. 

For any element g G G, have 

g= [l v) u+ 9 (l v) u+ 

= {i ”)“+(„-% ^ 

The expression [g] := (yV~^g v~^g^ called the matrix de- 

composition of g with respect to the elements {u, u} C G- To see that 
the mapping g [g] gives a matrix isomorphism^ in the sense that 
[g + h] = [(/] + [h] and [gh] = [g][h] for all g,h G G, it is obvious that we 
only need to check the multiplicative property. We find that 

\m = 
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_ ( guj^h ^ gvuj^v gu+hv + gvu+v ^hv \ 

\v~^gu+h + v~^gvu+v~^h v~^gu+hv + v~^gvu+v~^hv J 

= “+(v-th /vto)“+ = l»''l- 

To fully understand the nature of this matrix isomorphism, we need 
to know about the nature of the entries of [^f] in (11). We will analyse 
the special case where G JR, although the relationship is valid for 
more general v. In this case, the entries of [^f] can be decomposed in 
terms of conjugations with respect to the elements u and v. Let a G G 
for which a~^ exists. The a- conjugate of the element g G Q is defined 

by = aga~^. 

We shall use u- and u-conjugates to decompose any element g into the 
form 

g = G\ + UG2 + v{G^ + UG4) (f 2 ) 

where Gi G Cg{{u,v}), the subalgebra of all elements of Q which com- 
mute with the subalgebra generated by {u,v}. It follows that Q = 
Gg{{u,v}) ( 8 > {ri, u} or ^ = M. 2 {Cg{{u,v})). This means that Q is iso- 
morphic to a 2 X 2 matrix algebra over Gg{{u,v}). 

We first decompose g into the form 

9 =^{9 + T) + ^[^(5 “ 9^)] =91+ vg2 

where gi,g2 G Cg{u), the geometric subalgebra of Q of all elements which 
commute with the element u. By further decomposing gi and g2 with 
respect to the u-conjugate, we obtain the decomposition 

g = Gi + UG2 + v{G^ -\- uG^) 
where each Gi G Gg{{u,v}). Specifically, we have 

Gi = ^(51 + 9i) = ^(5 + U9U + vgv~^ -h vuguv~'^) 

G2 = 1(51 - 5 i) = ^(5 + ugu - vgv~'^ - vuguv~^) 

X X) ^ 

G3 = -(52 + gl) = —{9 - ugu + vgv~^ - vuguv~^) 

^ u, UV~^ , _1 

Ga = -{92 - 92) = ^^{9 - ugu - vgv + vuguv ). 

Using the decomposition (12) of 5 , we find the 2x2 matrix decom- 
position ( 11 ) of g over the module Cg{{u,v}), 
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where G* G Cg{{u,v}) for 1 < i < 4. 

For example, for g G Gs and u = ei, v = eu = ~v~^, we write 
g = (zi + UZ2) + v{z3 + UZ4), where Zj = Xj + iyj for i = 6123 and 
1 < J < 4. Noting that u±u = uu± = and u±v = vuzp, and 
substituting this complex form of g into the above equation gives 



We say that 



= (1 v) u+ ^ 

[9] = u+ 



( ^ 

\v ^g 


gv \ 
v~^gv J 


zi + 2:2 


1 

CO 


24 + 23 


Zl - Z2 


f zi A- 


Z2 Z4 - 



1 



1 

,-i 



Z4 + Z3 - Z2 



is the matrix decomposition of g G G3 over the complex numbers, G = 
{x + iy} where i = 6123. It follows that 



Gs = M2{C). 

There are many decompositions of Clifford geometric algebras into 
isomorphic matrix algebras. As shown in the example above, a matrix 
decomposition of geometric algebra is equivalent to selecting a spectral 
basis, in this case 

C"-). 

as opposed to the standard basis for the algebra. The relative position 
of the elements in the spectral basis, written as a matrix above, gives 
the isomorphism between the geometric algebra and the matrix algebra. 

There is a matrix decomposition of the geometric algebra Gp+i,q+i 
that is very useful. For this decomposition we let u = ep+iCp+q+2, so 
that the bivector u has the property that = 1 , and let v = e^+i. We 
then have the idempotents u± = ^(1 ± u), satisfying vu± = UzfV, and 
giving the decomposition 



Gp+l,q+l — Gl,l < 8 * Gp,q — ■M-2{Gp,q) ( 14 ) 

for Gi,i — gsn{ep4-i , ep-|_q-|_2} and Gp,q — gGn{c\ , ,Cp, ep_|_2, . . . , ^p+g+i }• 

4. Projective Geometries 

Leonardo da Vinci (1452-1519) was one of the first to consider the 
problems of projective geometry. However, projective geometry was not 
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formally developed until the work “Traite des propries projectives des 
figure” of the French mathematician Poncelet (1788-1867), published in 
1822. The extrordinary generality and simplicity of projective geometry 
led the English mathematician Cayley to exclaim: “Projective Geometry 
is all of geometry” [18]. 

The projective plane is almost identical to the Euclidean plane, ex- 
cept for the addition of ideal points and an ideal line at infinity. It seems 
natural, therefore, that in the study of analytic projective geometry the 
coordinate systems of Euclidean plane geometry should be almost suf- 
ficient. It is also required that these ideal objects at infinity should be 
indistinquishable from their corresponding ordinary objects, in this case 
ordinary points and ordinary lines. The solution to this problem is the 
introduction of “homogeneous coordinates”, [6, p. 71]. The introduc- 
tion of the tools of homogeneous coordinates is accomplished in a very 
efficient way using geometric algebra [9] . While the definition of geomet- 
ric algebra does indeed involve a metric, that fact in no way prevents 
it from being used as a powerful tool to solve the metric-free results 
of projective geometry. Indeed, once the objects of projective geometry 
are identified with the corresponding objects of linear algebra, the whole 
of the machinery of geometric algebra applied to linear algebra can be 
carried over to projective geometry. 

Let be an (n -|- l)-dimensional euclidean space and let Gn+i 

be the corresponding geometric algebra. The directions or rays of non- 
zero vectors in are identified with the points of the n-dimensional 

projective plane n”. More precisely, we write 

jjn ^ 

where M* = M — {0}. We thus identify points, lines, planes, and higher 
dimensional k-planes in II”' with 1, 2, 3, and {k + l)-dimensional sub- 
spaces of where k < n. To effectively apply the tools of 

geometric algebra, we need to introduce the basic operations of meet 
and join. 

4.1 The Meet and Join Operations 

The meet and join operations of projective geometry are most easily 
defined in terms of the intersection and union of the linear subspaces 
which name the objects in If”. Each r-dimensional subspace A'^ is de- 
scribed by a non-zero r-blade Ar G We say that an r-blade 

Ar A ^ represents, or is a representant of an r-subspace A^ of if 

and only if 



X = {xGIR”+^1 xCAr = t)]. 



(15) 
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The equivalence class of all nonzero r-blades G which define 

the subspace is denoted by 

{Ar}ray ■= {tAr \ t ^ M, t / 0}. (16) 

Evidently, every r-blade in {Ar}ray is a representant of the subspace 
A^ . With these definitions, the problem of finding the meet and join is 
reduced to the problem of finding the corresponding meet and join of 
the (r + 1)- and (s + l)-blades in the geometric algebra which 

represent these subspaces. 

Let ylr, Bg and Ct be non-zero blades representing the three subspaces 
A^, and C*, respectively. Following [15], we say that 

Definition 5 The t-hlade Ct = A^ C\ Bg is the meet of Ar and Bg 
if there exists a eomplementary (r — t) -blade Ac and a eomplementary 
(s — f)-blade Be with the property that Ar = Acf\Ct, Bg = CtABc, and 
AcABc a 0- 

It is important to note that the t-blade Ct G {Ct}ray is not unique 
and is defined only up to a non-zero scalar factor, which we choose at 
our own convenience. The existence of the t-blade Ct (and the corre- 
sponding complementary blades Ac and Be) is an expression of the basic 
relationships that exists between linear subspaces. 

Definition 6 The {r s — t)-blade D = Arid Bg, ealled the join of Ar 
and Bg is defined by D = Ar U Bg = ArABc- 

Alternatively, since the join Ar U Bg is defined only up to a non-zero 
scalar factor, we could equally well define Dhy D = AcABg. We use the 
symbols H interseetion and U union from set theory to mark this unusual 
state of affairs. The problem of “meet” and “join” has thus been solved 
by finding the direct sum and intersection of linear subspaces and their 
(r -|- s — t)-blade and t-blade representants. 

Note that it is only in the special case when ArCiBg = 0 that the join 
can be considered to reduee to the outer product. That is 

ArCBg = t) ArCBg = ArABg. 

In any case, once the join J := ArU Bg has been found, it can be used 
to find the meet 

ArHBg = Ar- [Bg • J] = [JJAr] ■ [Bg • J] = [{Ar • J)A{Bg ■ J)] ■ J (17) 

In the case that J = /~^, we can express this last relationship in terms 
of the operation of duality defined in (10), Ar D Bg = {A*AB*)* = 
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{A*CB*)* which is DeMorgan’s formula. It must always be remembered 
that the “equalities” in these formulas only mean “up to a non-zero real 
number” . While the positive definite metric of is irrelevant to the 

definition of the meet and join of subspaces, the formula (17) holds only 
in 

A slightly modified version of this formula will hold in any non- 
degenerate pseudo-euclidean space and its corresponding geometric 
algebra Gp^q := where p + q = n + 1. In this case, after we have 

found the join J = Bg, we first find any blade J of the same step 
which satisfies the property that J ■ J = 1. The blade J is called a 
reciprocal blade of the blade J in the geometric algebra Gp,q- The meet 
Aj. n Bg may then be defined by 

ArCBg = Ar-iBg-l] = [{Ar-l)-J]-[Bg-l] = {[{Ar - 1 ) C{B g ■!)]}■ J (18) 

The meet and join operations formulated in geometric algebra can 
be used to efficiently prove the many famous theorems of projective 
geometry [9]. See also Geometric- Affine- Projective Computing at the 
website [ 12 ], 

6.1 Incidence, Projectivity and Colineation 

Let J G Gp'^^ be a (/c-|-l)-blade representing a projective /c-dimensional 
subplane in 11"' where k < n. A point (ray) x G II" is said to be incident 
to J if and only if xA J = 0. Since J is a (A: -|- l)-blade, we can find 
vectors oi, . . . , a^+i G such that J = aiA • • • Aa^+i. Projectively 
speaking, this means we can find k -\-l non-co(/c — l)planar points a* in 
the k-projective plane J. 

Now let J be a reciprocal blade to J with the property that J ■ J = 1. 
With the help of J, we can define a determinant function or bracket 
[• • • ] J on the projective A:-plane J. Let 6 i, . . . , 6^+1 be {k -|- 1) points 
incident to J, 

[&i, • • • ) ^fc+i]j := (^ 1 ^ • • • A 6 fc+i) • J . (19) 

The bracket [ 6 i, • • • , / 0 iff the points bi are not co-{k — l)planar. 

We now give the definitions necessary to complete the translation 
of real projective geometry into the language of multilinear algebra as 
formulated in geometric algebra. 

Definition 7 A central perspectivity is a transformation of the points 
of a line onto the points of a line for which each pair of corresponding 
points is collinear with a fixed point called the center of perspectivity. 
See Figure 4- 
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The key idea in the analytic expression of projective geometry in ge- 
ometric algebra is that to each projectivity in II” there corresponds 
a non-singular linear transformation^ T : IRP’^ — > It is clear 

that each projectivity of points on II” induces a corresponding projec- 
tive collineation of lines, of planes, and higher dimensional projective 
/c-planes. The corresponding extension of the linear transformation T 
from to the whole geometric algebra Qp^q which accomplishes this 
is called the outermorphism T : Qp^q — > Gp,q: which is defined in terms 
of T by the properties: 

T(l) := 1, T{x)=T{x), T(a;iA • • • AXfc) := T(xi)A • • • AT(xfc) (20) 

for each 2 < k < p q, and then extended linearly to all elements of 
Gp^q- Outermorphisms in geometric algebra, first studied in [16], provide 
the backbone for the application of geometric algebra to linear algebra. 
Since in everything that follows we will be using the outermorphism T 
defined by T, we will drop the boldface notation and simply use the 
same symbol T for both the linear transformation and its extension to 
an outermorphism T. 



0 




Figure 4- A central perspectivity from the point o. 



^Unique up to a non-zero scalar factor. 
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Definition 8 A projective transformation or projectivity is a transfor- 
mation of points of a line onto the points of a line which may be expressed 
as a finite product of central perspectivities. 

We can now easily prove 

Theorem 9 There is a one-one correspondence between non-singular 
outermorphisms T : Qp^q — > Qp,q! projective collineations on H"' 
taking n + 1 non-co{n — l)planar points in II"' into n + 1 non-co(n- 
1) planar points in II". 

Proof: Let oi, . . . , On+i G II" be n + 1 non-co(n-l)planar points. Since 
they are non-co(n-l)planar, it follows that oiA • • • Aa^+i / 0. Suppose 
that bi = T{ai) is a projective transformation between these points for 
1 < i < n + 1. The corresponding non-singular outermorphism is de- 
fined by considering T to be a linear transformation on the basis vectors 
ai,...,an+i of Conversely, if a non-singular outermorphism is 

specified on it clearly defines a unique projective collineation on 
n", which we denote by the same symbol T. 

All of the theorems on harmonic points and cross ratios of points on 
a projective line follow easily from the above definitions and properties 
[9], but we will not prove them here. For what follows, we will need two 
more definitions: 

Definition 10 A nonidentity projectivity of a line onto itself is ellip- 
tic, parabolic, or hyperbolic as it has no, one, or two fixed points, re- 
spectively. More generally, we will say that a nonidentity projectivity of 
n" is elliptic, parabolic, or hyperbolic, if whenever it fixes a line in II", 
then the restriction to each such line is elliptic, parabolic, or hyperbolic, 
respectively. 

Let a, 6 G II" be distinct points so that a/\b / 0. If T is a projectivity 
of II" and T(oA 6 ) = AoA 6 for A G IR* , then the characteristic equation 
of T restricted to the subspace oA 6 , 

[(A — T)(oA 6 )] • aA 6 = 0 (21) 

will have 0, 1 or 2 real roots, according to whether T has 0, 1 or 2 real 
eigenvectors, [15], [ 8 , p.73], which correspond directly to fixed points. 

Definition 11 A nonidentity projective transformation T of a line onto 
itself is an involution ifT‘^ = identity. 

11.1 Conics and Polars 

Let oi, 02 , • . . , o„+i G represent n -|- 1 = p + q linearly indepen- 
dent vectors in This means that I = aiAo 2 A • • • AOn+i 7 ^ 0. As an 
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element in the projective space II”, I represents the projective n-plane 
determined by the n + 1 non-co(n — Ijplanar points ai, . . . , On+i- Rep- 
resenting the points of II” by homogeneous vectors x G makes it 

easy to study the quadric hypersurface (conic) Q in II” defined by 

Q := {x\ X G IRF^’^^x / 0, and x"^ = 0}. (22) 

Definition 12 The polar of the k-blade A G Qp q is the (n + l — k) -blade 
PoIq{A) defined by 

PoIq{A) := AI~^ = yl* (23) 

where A* is the dual of A in the geometric algebra Gp,q- 

The above definition shows that polarization in the quadric hypersurface 
Q and dualization in the geometric algebra Qp^q are identical operations. 
If X G Q, it follows that 

x/\PoIq{x) = x/\{xl~^) = x‘^I~^ = 0. (24) 

This tells us that Polgfx) is the hyperplane which is tangent to Q at 
the point x, [12]. We will meet something very similar when we discuss 
the horosphere in section 5. 

13. Affine and other geometries 

In this section, we explore in what sense “projective geometry is all 
of geometry” as exclaimed by Cayley. In order to keep the discussion as 
simple as possible, we will discuss the relationship of the 2 dimensional 
projective plane to other 2 dimensional planar geometries, [6]. 

We begin with affine geometry of the plane. Let 11^ be the real pro- 
jective plane, and T(II^) the group of all projective transformations on 
n^. Let a line C G (a degenerate conic) be picked out as the absolute 
line, or the line at infinity. 

Definition 14 The affine plane A? consists of all points of the projec- 
tive plane with the points on the absolute line deleted. The projective 
transformations leaving C fixed, restricted to the real affine plane, are 
real affine transformations. The study of A‘^ and the subgroup of real 
affine transformations T{A^} is real plane affine geometry. 

If we now fix an elliptic involution, called the absolute involution, on 
the line C, then a real affine transformation which leaves the absolute 
involution invariant is called a similarity transformation. 

Definition 15 The study of the group of similarity transformations on 
the affine plane is similarity geometry. 
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An affine transformation which leaves the area of all triangles invariant 
is called an equiareal transformation. 

Definition 16 The study of the affine plane under equiareal transfor- 
mations is equiareal geometry. 

A euelidean transformation on the affine plane is an affine transforma- 
tion which is both a similarity and an equiareal transformation. Finally, 
we have 

Definition 17 The study of the affine plane under euelidean transfor- 
mations is euelidean geometry. 

In our representation of II"' in a geometric algebra Qp^q where n -|- 1 = 
p-\- q, a projectivity is represented by a non-singular linear transforma- 
tion. Thus, the group of projectivities becomes just the general linear 
group of all non-singular transformations on extended to outermor- 
phisms on Qp^q. Generally, we may choose to work in a euclidean space 
rather than in the pseudo-euclidean space [9], [12]. We have 
chosen here to work in the more general geometric algebra Qp,q^ because 
of the more direct connection to the study of a particular nondegenerate 
quadric hypersurface or conic in IRP'^. 

We still have not mentioned two important classes of non-euclidean 
plane geometries, hyperholie plane geometry and elliptie plane geometry, 

, and their relation to projective geometry. Unlike euclidean geometry, 
where we picked out a degenerate conic called the absolute line or line 
at infinity, the study of these other types of plane geometries involves 
the picking out of a nondegenerate conic called the absolute eonie . 

For plane hyperbolic geometry, we pick out a real non-degenerate 
conic in 11^, called the absolute eonie, and define the points interior to 
the absolute to be ordinary , those points on the absolute are ideal , and 
those points exterior to the absolute are ultraideal [6, p.230j. 

Definition 18 A real projeetive plane from whieh the absolute eonie 
and its exterior have been deleted is a hyperbolie plane. The projee- 
tive eollineations leaving the absolute fixed and earrying interior points 
onto interior points, restrieted to the hyperbolie plane, are hyperbolie 
isometries. The study of the hyperboie plane and hyperbolie isometries 
is hyperbolie geometry. 

Hyperbolic geometry has been studied extensively in geometric algebra 
by Hongbo Li in [3, p. 61-85], and applied to automatic theorem proving 
in [3, p.110-119], [5, p.69-90]. 

For plane elliptic geometry, we pick out an imaginary nondegenerate 
conic in H2 as the absolute eonie. Since there are no real points on 
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this conic, the points of elliptic geometry are the same as the points in 
the real projective plane 11^. A projective collineation which leaves the 
absolute conic fixed (whose points are in the complex projective plane) 
is called an elliptic isometry . 

Definition 19 The real projective plane is the elliptic plane. The 
study of the elliptic plane and elliptic isometries is elliptic geometry. 

In the next section, we return to the study of affine geometries of 
higher dimensional pseudo-euclidean spaces. However, we shall not 
study the formal properties of these spaces. Rather, our objective is to 
efficiently define the horosphere of a pseudo-euclidean space, and study 
some of its properties. 

20. Affine Geometry of pseudo-euclidean space 

We have seen that a projective space can be considered to be an affine 
space with idealized points at infinity [18] . Since all the formulas for meet 
and join remain valid in the pseudo-euclidean space using (18), we 

define the n = (p-l-g)-dimensional affine plane Ae{lRP’‘^) of the null vector 
e = ^{a+rf) in the larger pseudo-euclidean space , 

where = span{a,rf\ for cj^ = 1 = — Whereas, effectively, we 
are only extending the euclidean space by the null vector e, it 

is advantageous to work in the geometric algebra of the non- 

degenerate pseudo-euclidean space iRP+^’^+^. We give here the important 
properties of the reciprocal null vectors e = + %]) and e = a — rj that 

will be needed later, and their relationship to the hyperbolic unit bivector 
u := ar]. 

= 0, e-e = 1, u = eAe = crArj, = 1. (25) 

The affine plane := Ae{lRP’‘^) is defined by 

AeilRP’^) = {xh = x + e\ xe } C 1RP+^’^+\ (26) 

for the null vector e G lR^d_ The affine plane Ae{lR^’^) has the nice 
property that x\ = for all xg G Ae{TdP’^), thus preserving the metric 
structure of IR^’^. We can restate definition (26) of Ae{3fP’‘^) in the form 

AeilRP’^) = {y\ ye iR^+i’^+i ^ y.e = l and ye = 0 } C 

This form of the definition is interesting because it brings us closer to 
the definition of the n = {p g)-dimensional projective plane . 

The projective n-plane H"^ can be defined to be the set of all points 
of the affine plane Ae{lBP'^), taken together with idealized points at 
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infinity. Each point G Ae{lRP'^) is called a homogeneous representant 
of the corresponding point in II"' because it satisfies the property that 
Xh-e = 1. To bring these different viewpoints closer together, points in 
the affine plane Ae{hR^’'^) will also be represented by rays in the space 

y e iRP+b5+i,y.e = 0, ye / 0 } C IRP+^’‘^+\ 

(27) 

The set of rays gives another definition of the affine n-plane, 

because each ray {y}ray £ Ae '^^^ determines the unique homoge- 
neous point 

Vh = ^e Ae{M>'^). 
y-e 

Conversely, each point y G Ae{M'P''^) determines a unique ray {y}ray in 
Thus, the affine plane of homogeneous points Ae{SF''^) is 
equivalent to the affine plane of rays A!'e‘^^^ 

Suppose that we are given /c-points aj',a 2 , ^ Ae{lBP'^) where 

each = Oi + e for Ui G IRP'^. Taking the outer product or join of these 
points gives the projective {k — l)-plane G II". Expanding the outer 
product gives 

A’^ = Ao 2 A . . . AOfc = A(o 2 - o^jAag A . . . Aa^ 

= Og A(o 2 - ag)A(a| - o^jAa^ A . . . Aa^ = . . . 

= aj- A( o 2 - ai)A(a3 - 02)A . . . A(afe - ak-i), 
or 

A’^ = Og Ao 2 A . . . Aa^ = oiAa2A . . . Ao^-h 

eA(a2 - ai)A(o3 - 02)A . . . A(afc - Ofc-i). (28) 

Whereas (28) represents a (/c — l)-plane in II", it also belongs to the 
affine (p, g)-plane Ae’'^, and thus contains important metrical informa- 
tion. Dotting this equation with e, we find that 

= e-(a3 Aa^A . . . Ao^) = (02 — ai)A(a3 — 02)A . . . A(ofc — afc_i). 

This result motivates the following 

Definition 21 The direeted eontent of the {k — l)-simplex 

A^ = Og Ao 2 A • • • AUfc 

in the affine (p,q) -plane is given by 

e-A^ e-(a3 Ao 2 A . . . Aa^) 

(A:-!)! ^ (A - 1)! 

_ (fl2 - oi)A(a3 - 02)A . . . A(afc - a^-i) 

(A-1)! 
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21.1 Example 

Many incidence relations can be expressed in the affine plane 
which are also valid in the projective plane II"', [3, pp.263]. We will only 
give here the simplest example. 

Given are 4 coplanar points ah,bg,Ch,dh G Ae(lR^). The join and 
meet of the lines agAbh and ChCdh are given, respectively, by (agAbh) U 
(chAdh) = auAbhAch, and using (18), 

{agAbh) n (chAdh) = [I ■{ahAbh)]-{chAdh) 

where ei,C 2 are the orthonormal basis vectors of and I = e 2 AeiAe. 
Carrying out the calculations for the meet and join in terms of the 
bracket determinant (19), we find that 

{auAbh) U (chAdh) = [at, bh, Ch]jl = det{a, b}I (29) 

where I = eiAe 2 Ae and det{a, 6} := (aAb) ■ (e 2 i), and 

(afiAbh) n (cfiAdh) = det{c — d,b ~ cfag + det{c — d,c — a}bh- (30) 

Note that the meet (30) is not, in general, a homogeneous point. 
Normalizing (30), we find the homogeneous point ph G Ae^dld?) 

det{c — d,b — cjag + det{c — d,c — a}bh 
det{c — d, 6 — a} 

which is the intersection of the lines ahAbg and cgAdh- The meet can 
also be solved for directly in the affine plane by noting that 

Ph ~ Olp(^h T (1 Olp)bh — PpCfi T (1 Pp)dh 

and solving to get ap = [bh,Ch,dh]j/[bh - ah,Ch,dh]j. Other simple 
examples can be found in [15]. 

22. Conformal Geometry and the Horosphere 

The conformal geometry of a pseudo-Euclidean space can be linearized 
by considering the horosphere in a pseudo-Euclidean space of two dimen- 
sions higher. We begin by defining the horosphere in by 

moving up from the affine plane Ae’'^ := Ae{lRP’^)- 

22.1 The horosphere 

Let = gen{]RF^^’^^^) be the geometric algebra of 

and recall the definition (26) of the affine plane A^’‘^ := C 
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Any point y G can be written in the form y = 

X + ae + pie, where x G and a, P G M. 

The horosphere TPp^ is most directly defined by 

:= {xc = Xh + pe \ Xh G Ap'^ and x^ = 0.} (31) 



With the help of (25), the condition that 

x^ = {xh + /3e)^ = + 2/3 = 0 

gives us immediately that P := Thus each point Xc G TLp^ has the 
form 

1 

Xc = Xh- ye = X + e - ye = -Xhexh- (32) 

The last equality on the right follows from 

]-Xhexh = ]-[{xh-e)xh + {xh/\e)xh] =Xh~ ^x|e. 



From (32), we easily calculate 

X^ 

Xc ■ yc = {x + e - —e) • (y + e - ye) = 



2 2 



-\{x-yf, 



where (x — y)^ is the square of the pseudo-euclidean distance between 
the conformal representants Xc and yc- We see that the pseudo-euclidean 
structure is preserved in the form of the inner product Xc • yc on the 
horosphere. 

Just as Xh G Ap‘^ is called the homogeneous representant of x G 
the point Xc is called the eonformal representant of both the points Xh G 
Ap^ and X G The set of all conformal representants := 

c{]RP''^) is called the horosphere . The horosphere is a non-linear 
model of both the affine plane and the pseudo-euclidean space 
The horosphere TC^ for the Euclidean space iR” was first introduced by 
F.A. Wachter, a student of Gauss, [7], and has been recently finding 
many diverse applications [3], [5]. 

The set of all null vectors y G make up the null eone 



A7:= {y G iR^+i’''+i| y^ = 0}. 



The subset of A7 containing all the representants y G {xcjray for any 
X G is defined to be the set 



A/q — {y G A7 I y-e A 0} — Ua,gjfjp,9 {Xc}ray: 
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and is called the restricted null cone. The conformal representant of a 
null ray {z}ray is the representant y G {z}ray which satisfies y-e = 1. 

The horosphere is the parabolic section of the restricted null 
cone, 

= {y g AAo I y-e = 1}, 

see Figure 5. Thus has dimension n = p + q. The null cone M is 
determined by the condition = 0, which taking differentials gives 

y-dy = 0 Xc-dy = 0, (33) 

where {y}ray = {xc}ray Since A/q is an (n + l)-dimensional surface, 
then (33) is a condition necessary and sufficient for a vector v to belong 
to the tangent space to the restricted null cone at the point y 

V G T(A/o) Xc-v = t) . (34) 

It follows that the (n + l)-pseudoscalar ly of the tangent space to A/q at 
the point y can be defined by ly = I Xc where I is the pseudoscalar of 

j^p+l,q+l_ 

Xc-v = 0 0 = I{xc-v) = {Ixc)Av = lyAv, (35) 

a relationship that we have already met in (24). 



22.2 H-twistors 



Let us define an h-twistor to be a rotor Sx G Spinp.,.;^ 



S'x := 1 + ]jxe = exp (^xe) 



(36) 



An h-twistor is an equivalence class of two “twistor” components from 
Gpy that have many twistor-like properties. The point Xc is generated 
from Oc = e by 

Xc = SxcSx\ (37) 

and the tangent space to the horosphere at the point Xc is generated 
from dx G IRP''^ by 

dxc = dSxcSx"^ + SxcdSx"^ = Sx{Cls-e)Sx'^ = SxdxSx"^ ■ (38) 



It also keeps unchanged the “point at infinity” e 



e = 



SxeSx^- 



H-twistors were defined and studied in [15], and more details can be 
found therein. 
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Figure 5. The restricted null cone and representations of the point x in affine space 
and on the horosphere. 



Since the group of isometries in A/q is a double covering of the group of 
conformal transformations CoUp^q in and the group is a 

double covering of the group of orthogonal transformations 0{p+l,q+l), 
it follows that is a four-fold covering of Cbup^q, [10, p.220], 

[14, p.l46]. 



22.3 Matrix representation 

We have seen in (14) that the algebra is isomor- 

phic to a 2 X 2 matrix algebra over the module Gp,q- This identification 
makes possible a very elegant treatment of the so-called Vahlen matrices 
[10, 11, 4, 14], 

Recall in section 1.4, that the idempotents u± = ^(1 ± u) of the 
algebra Qi^\ satisfy the properties 

u+ + U- = 1 , u+ — U- = u , u+u- = 0 = U-U+ , au+ = U-a , 



where 



and 



1 _ 1 _ 

u := eAe , u+ = -ee , U- = -ee , 
’ + 2 2 



ue = e = —eu , eu = e = —ue , cJtt+ = e , 2au. 



= e . 
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G = (l O') rt+[G] = Auj^ + Bu+a + C*U-a + D*u- (39) 
where 

[G] = ^ for A,B,C,D gGp,,. 

The matrix [G] denotes the matrix corresponding to the multivector G. 

The operation of reversion of multivectors translates into the following 
transpose-like matrix operation: 

if [G] = then [G]^= [G^] = ^ 

where A = A*^ is the Clifford eonjugation, [15]. 

22.4 Mobius transformations 

We have seen in (37) that the point Xc G T~lp,q can be written in the 
form, Xc = SxeSx^ ■ More generally, any conformal transformation f{x) 
can be represented on the horosphere by 

f(x)c = Sf(x)eSf(^x)^. (40) 

Using the matrix representation (39), for a general multivector G G 
we find that 



[GeGt] 



fA B\ fO 0\ (D B\ 

[c d) Vi o; V^ a) 



= (o)t b) 

where 



(41) 



[c] 





[Gjt 





The relationship (41) suggests defining the eonformal h-twistor of the 
multivector G G to be 
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which may also be identified with the multivector Gc ■= Ge = Buj^ + 
D*e. The conjugate of the conformal h-twistor is then naturally defined 
by 



[G]t := {D B) . 



Conformal h-twistors give us a powerful tool for manipulating the confor- 
mal representant and conformal transformations much more efficiently. 
For example, since Xc in (37) is generated by the conformal h-twistor 
it follows that 



w = [sjjsji = (1 -x) = (j . 



We can now write the conformal transformation 
form, 




(40) in its spinorial 



Since = RSx for the constant vector R G , its spinorial 

form is given by 



= [H|[SJ, = (p o) (l) 



f Ax -|- B\ 

VCx + d) 




where 




It follows that 



for constants A, B,G,D G Gp,q- 



[T.] = 





H = N and f{x) = MN~\ (42) 



The beautiful linear fractional expression for the conformal transfor- 
mation /(x), 

f{x) = {Ax + B){Cx + D)-^ (43) 

is a direct consequence of (42), [15]. 

The linear fractional expression (43) extends to any dimension and 
signature the well-known Mobius transformations in the complex plane. 
The components A, B, C, D of [i?j are subject to the condition that R G 
Pmp+i^g+i. Conformal h-twistors are a generalization to any dimension 
and any signature of the familiar 2-component spinors over the complex 
numbers, and the 4-component twistors. Penrose’s twistor theory [13] 
has been discussed in the framework of Clifford algebra by a number of 
authors, for example see [1], [2, pp75-92]. 
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Abstract This paper presents a general framework for efficiently indexing and 
searching large collections of multimedia documents by content. Among 
the multimedia information retrieval scenarios that fit into this frame- 
work are music, audio, image and 3D object retrieval. Combining the 
technique of inverted files with methods from group theory we obtain 
space efficient indexing structures as well as time efficient search pro- 
cedures for content-based and fault-tolerant search in multimedia data. 
Several prototypic applications are discussed demonstrating the capa- 
bilities of our new technique. 

Keywords: multimedia retrieval, content-based retrieval, music and audio retrieval. 

1. Introduction 

The last few years have seen an increasing importance of multimedia 
databases for a wide range of applications. As one major reason, the 
availability of affordable high-performance hardware now allows for ef- 
ficient processing and storage of the huge amounts of data which arise, 
e.g., in video, image, or audio applications. A key philosophy to access- 
ing data in multimedia databases is content-based retrieval ([Yoshitaka 
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and Ichikawa, 1999]), where the content of the multimedia documents is 
processed rather than just some textual annotation describing the doc- 
uments. Hence, a content-based query to an image database asking for 
all images showing a certain person would basically rely on a suitable 
feature extraction mechanism to scan images for occurrences of that per- 
son. On the other hand, a classical query based on additional textual 
information would rely on the existence of a suitable textual annotation 
of the contents of all images. Unfortunately, in most cases such an anno- 
tation is neither available nor may it be easily extracted automatically, 
emphasizing the demand for feasible content-based retrieval methods. 

It turns out that many content-based retrieval problems share essen- 
tial structural properties. We briefly sketch two of those problems. 

Let us first consider a problem from music information retrieval. As- 
sume that a music database consists of a collection of scores. That is, 
each database document is a score representation of a piece of music 
containing the notes of that piece as well as additional information such 
as meter or tempo. Now we consider the following database search prob- 
lem: Given a melody or, more generally, an arbitrary excerpt of a piece 
of music, we are looking for all occurrences of that query or slight vari- 
ations thereof in the database documents. The result of such a query 
could for example help music professionals to discover plagiarism. A sim- 
pler, yet very active application area is to name a tune that is whistled 
or hummed into a microphone. As a query result, a user could expect 
information on title, composer, and consumer information on an audio 
CD containing the corresponding piece of music. 

The second problem is concerned with content-based image retrieval. 
Consider a database consisting of digital copyrighted images. Assume 
that we are interested, e.g., for some legal reasons, in finding all web 
pages on the Internet containing at least one of the copyrighted images or 
fragments thereof. This problem may be again considered as a database 
search problem: Given a (query) image taken from some web page, we 
are looking for an occurrence of that image as a subimage of one of the 
database images, including the case that the query matches one of those 
images as a whole. Extensions to this problem include that we are also 
interested in finding rotated, resized, or lower-quality versions of the 
original images. 

Both of the above problems may be viewed in the following general 
setting: A query to a database consisting of a collection of multimedia 
documents has to be answered in the sense that certain transformations 
(or generalized shift operations) have to be found which transport the 
query to its location within the database document. In this tutorial pa- 
per, we systematically exploit this principle for the case that admissible 
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transformations are taken from a group acting on a set which in turn con- 
stitute the database documents. It turns out that this approach leads to 
very efficient search algorithms for a large class of content-based search 
problems, which are in particular applicable to spatial-, temporal-, or 
spatio-temporal retrieval settings ([Yoshitaka and Ichikawa, 1999]). 

We briefly summarize the main contributions of our approach: 

■ We develop a general framework for retrieval of multimedia doc- 
uments by example. Our technique’s flexibility has been demon- 
strated by prototypes in various fields (e.g., music, audio, image, 
and (relational) object retrieval). 

■ We propose generic algorithms for query evaluation together with 
efficient algorithms for fault-tolerant retrieval which consequently 
exploit the structure inherent in the retrieval problems. 

■ In contrast to previously reported approaches, query evaluation 
becomes more effieient when the complexity of a query increases. 

■ The concept of partial matches (i.e., a query is only matched to 
a part of a document) is an integral part of our technique and 
requires no additional storage. 

The proposed technique has been successfully tested on a variety of 
content-based retrieval problems. We summarize some figures on those 
prototypes: 

■ Our PROMS system, for the first time, allowed for efficient poly- 
phonic search in polyphonic scores ([Clausen et ah, 2000]). E.g., 
queries to a database of 12,000 pieces of music containing 33 mil- 
lion notes can be answered in about 50 milliseconds. 

■ Our system for searching large databases of audio signals allows 
for both identifying and precisely locating short fragments of au- 
dio signals w.r.t. the database. The sizes of our search indexes 
are very small, e.g., only 1:1,000-1:15,000 the size of the original 
audio data depending on the required retrieval granularity. As an 
example, a database of 180 GB of audio material can be indexed 
using about 50 MB only, while still allowing audio signals of only 
several seconds of length to be located within fractions of seconds 
([Ribbrock and Kurth, 2002]). 

■ Index construction may be performed very efficiently. For the 
PROMS system, index construction takes only a few minutes, 
hence allowing for indexing on the fly. Indexing PCM audio data 
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may be performed several times faster than real-time on standard 
PC hardware. 

■ In our prototypic image retrieval system containing 3,300 images, 
exact sub-image queries require about 50ms response time ([Roder, 
2002]). The search index is compressed to 1:6 compared to the 
original (JPEG) data. 

The paper is organized as follows. In the next section, we discuss sev- 
eral motivating examples and give an informal overview on the concepts 
of our approach to content-based multimedia retrieval. Those concepts 
are introduced formally in Sections 3 and 4. Section 3 deals with a for- 
mal specification of documents, queries, the notion of matches, and the 
derivation of fast retrieval algorithms. Section 4 introduces two gen- 
eral mechanisms to incorporate fault tolerance: mismatches and fuzzy 
queries. In Section 5 we present prototypic applications from the fields 
of music-, audio-, image-, and object retrieval. Finally, Section 6 gives a 
brief overview on related work and suggest some future research direc- 
tions. 

2. Motivating Examples 

The goal of this section is to present three motivating examples that 
illustrate the desirability of a unified approach to content-based retrieval. 
The first two are concerned with text retrieval tasks, whereas the third 
discusses content-based retrieval in score-based music data. 

As a first example, suppose we have a collection T = (Ti, . . . , T/v) of 
text documents each consisting of a finite sequence of words where each 
word is contained in a set (i.e., a dictionary) W of all admissible words. 
Taking a coarse level of granularity, each text document is preprocessed 
in order to extract a set of terms occurring in that document. If T 
denotes the set of all conceivable terms, then this preprocessing step 
produces a sequence V = {D \, . . . , Dm) of finite subsets of T, where Di 
is the set of terms extracted from the ith document. We may think of 
the term set T as a reduced version of IT, where unimportant words have 
been left out and verbs, nouns, etc. in IT have been replaced by their 
principal forms, e.g., by means of a stemming algorithm (e.g., sitting ^ 
sit). 

Next, let us think about content-based queries. A usual way of formu- 
lating a query is to input a finite set Q of terms to the retrieval system. 
Then one possible task is to compute the set Hx>{Q) of all exact partial 
matches, which is the set of all documents containing all of the terms 
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specified by Q. More formally, with [1 : N] := {1, . . . , N}, 

Hv{Q) := {f G [1 : iV] I g C A}- (1) 

Note that this is a rather strict decision about the Q-relevance of a 
document: if just one q G Q does not occur in Di, then A is already 
considered irrelevant. A more realistic way to estimate relevance would 
be a ranking of the documents along the quotients \Q CDi\/\Q\ G [0, 1]. 
This quotient equals 1 if and only if Q is a subset of A- 

From the viewpoint of logic, presenting a set Q = {qi, ■ ■ ■ iq-n} of 
terms to the retrieval system is equivalent to the boolean query q\ A 
. . . Aqn which asks for all documents that contain all these terms. More 
generally, we can form boolean expressions like {ti A t2) V (fs A the 
latter asking for all documents containing both the terms ti and t2 or 
the term but not the term ^ 4 . 

After this brief introduction to content-based queries and different 
text retrieval tasks, we are now going to discuss efficient algorithms for 
text retrieval. For each term t G U^^A one establishes a so-called 
inverted file, which is the (linearly ordered) set := {i G [1 : A"] | 

t G A}- Then, for a query Q C T, the set of all exact partial matches 
is obtained by intersecting the inverted files of all elements in Q: 

Hv{Q) = n Hv{t). 

t&Q 

This generalizes to boolean queries. For example, A t2) V (ts A 

-■^4)) = n H'D{t2)) U (H-riih) \ ^©(g))- 

The above discussion immediately leads to the following computa- 
tional problems: given (linearly ordered) finite sets A and B of integers, 
compute their intersection, union and difference, again linearly ordered. 
If a and b denote the cardinality of A and B, respectively, then Af] B, 
ACB and A\B may be computed with at most a-\-b comparisons using 
a merge technique. If a <C 6 then comparing the elements of A one after 
another by means of the binary search method (see, e.g., [Cormen et ah, 
1990]) with the elements of B causes at most a log 6 comparisons to solve 
each of these problems. 

So far we have discussed a rather coarse notion of a match. If for 
example the document ID i is an exact partial match w.r.t. the query Q, 
then we only know that all terms in Q do also occur in the set of terms 
extracted from the fth document. However, we do not know where these 
terms occur in the original text document Tj. This in turn is the main 
goal of full-text retrieval, where questions are allowed that ask, e.g., for 
a document containing the text passage ”to be or not to be.” We discuss 
this kind of retrieval next. 
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In our second example we consider full-text retrieval where we use 
a finer level of granularity. For this, we consider the word sequences 
constituting a text document. If W denotes the set of all words then 
the ith document is viewed as a sequence Di = {wio, . . . , Wim) over W. 
To make this example more comparable to the previous one, we will 
identify this sequence with the set {[j,Wij] \ j G [0 : rii]}. (Note that 
replacing sequences by their corresponding sets means a switch from an 
implicit to an explicit specification of the words’ places.) Similarly, a 
query like “to be or not to be” would then be identified with the set 
{[0, to], [1, be], [2, or], [3, not], [4, to], [5, be]}. 

In general, for a query Q = {qo, ...,qn) = {[j, qj] ] j G [0 : n]}, we are 
looking for the set Hx>{Q) of all pairs (t, i) such that t + Q '■= {[t+j, qj] \ 
} G [0 : n]| is a subset of D^. To obtain all those pairs efficiently we use, 
in analogy to the first example, an inverted file Hx>{w) := \ w = 

Wij}, for each word w that occurs in any of the documents. We claim 
that 

Hv{Q)=nl^o{Hv{qk)-k), (2) 

wheie Hx>{qk) - k := {{j-k,i) \ (j,i) G Hx>{qk)}- In fact, (j,i) G Hx>{Q) 
iff (go, ■■■ ,qn) = {wij, , Wi^j+n)- This is equivalent to (j + k,i) G 
Hx>{qk)i for all /c G [0 : n], i.e., (}, i) G Hx>{qk) ~ k, for all /c, which 
proves our claim. Note that we use square brackets (such as in [l,be]) 
to denote the elementary objects constituting our documents in order to 
avoid confusion with elements of the inverted files (such as (j, i) above) 
which are denoted by round brackets. 

According to this formula, the list of all solutions is again the inter- 
section of all relevant and properly adjusted inverted files. However, for 
stop words like the, a, is, to, be, the lists are rather long. To avoid long 
lists and to improve the query response time substantially, we consider 
instead of W the set of all pairs IT 2 as our new universe of elementary 
data objects. For each pair [u, rc] of words we generate the inverted file 
Hx>{v,w) := {(j, i) 1 [u,rc] = [tCjj, tCij+i]}. On the one hand, this simple 
trick drastically increases the number of inverted files, on the other hand, 
the new lists are typically much smaller than the original inverted files. 
The query “to be or not to be” can now be processed by the following 
intersection (check this!): 

Hj){to, be) n not) — 2) n {Hx>{to, be) — 4). 

Thus instead of intersecting six long lists when working with IT, we now 
have to intersect only three small lists. Obviously, this generalizes to 
/c-tuples of words, for any k. 
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After these rather classical examples let us now turn to our third ex- 
ample which deals with content-based music information retrieval. The 
reader should notice the analogy to full-text retrieval. 




Figure 1. A part of J.S. Bach’s Fugue in C major, BWV 846, in the piano roll 
notation. 

Suppose we have a collection T = (Ti,...,Tjv) of scores. That is, 
each Tj is a score representation of a piece of music containing the notes 
of that piece as well as additional information such as meter, tempo or 
specifications of dynamics. Musical scores in the conventional staff no- 
tation may be visualized using the so-called piano roll representation. 
Fig. 1 shows the piano roll representation of the beginning of Johann 
Sebastian Bach’s Fugue in C major, BWV 846. In this figure, the hor- 
izontal axis represents time whereas the vertical axis describes pitches. 
Each rectangle of width d located (w.r.t. its left lower corner) at coor- 
dinates (t,p) represents a note of pitch p, onset time t, and duration 
d. Such a note will be denoted by the triple \p,t,d]. (After a suitable 
quantization, we can assume w.l.o.g. that p, t and d are integers.) 




Figure 2. A query to the database in the piano roll notation. 

Now we consider the following music information retrieval task which 
has already been sketched in the introduction: given a fragment of a 
melody or, more generally, an arbitrary excerpt of a piece of music, 
we are looking for all occurrences of that content-based query in the 
collection. As an example consider the query depicted in Fig. 2. In 
fact, this query is a part of the fugue’s theme. Assume that we are 
looking for all positions where this theme or a pitch-transposed version 
thereof occurs. Then, Fig. 3 shows all occurrences of the query within 
the excerpt of the fugue given in Fig. 1. 
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Figure 3. Bach fugue of Fig. 1. All occurrences of the query in Fig. 2 are highlighted. 

To solve such matching problems, we first have to decide which parts 
of the complete score information are actually needed. By our experi- 
ence, the most important parameters for recognizing a piece of music are 
pitch and onset time. In other words, a person will recognize a piece of 
music even if it is played in staccato throughout, i.e., disregarding the 
notes’ durations. According to this experimental result, the first step is 
to extract from each score the pitches and onset times. This results in a 
new collection T> = {D \, . . . , Djg) of finite subsets D{ of Z^. A content- 
based query is — after a possible preprocessing step — again a finite subset 
Q of Z^. The set Hx>{Q) of all exact partial matches — when time- and 
pitch-shifts are allowed — is thus given by 

Hv{Q) := {(vr, r, i) G Z^ x [1 : A^] | (vr, r) + Q C A}, 

where {tt ,t) + Q := {[t: + p,T + t] \ [p, t] G Q}. If we construct for every 
pair \p,t] G Z^ an inverted file H-ri{\p,t]) := {(7r,r, i) G Z^ x [1 : A"] | 
[n + p,T + t] G Di}, then 

Hv{Q)= f| Hv{\p,t]). 

\p,t]£Q 

As in the case of text retrieval, the set of all exact partial matches is 
the intersection of all inverted files corresponding to the elements of Q. 
Obviously, there are infinitely many inverted files. Fortunately, from just 
one list one can recover the other inverted files according to the formula 

Hv{\p,t]) = Hv{[0,0])-{p,t), (3) 

where Hjy{[0,0]) - {p,t) := {(tt - p,r -t,i) \ G i^D([0,0])}. In 

fact, (7T,r, i) G Hx>{[p-,t]) iff [tt -|- p, r -|- t] G A iff {tt + p,T + t,i) G 
Hx>{[0,0]) iff {iT,T,i) G //x>([0,0]) — (p, f), which proves our claim. Due 
to this formula, we only need to store the inverted file //x>([0,0]). The 
other lists can easily be computed from it if required. Remarkably, if 
//x)([0,0]) is linearly ordered, then so is flx)([0,0]) — (p,t). 
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Let us analyze the computational cost of computing all exact partial 
matches w.r.t. the query Q using the formula 

Ht,{Q)= f| (77^([0,0])-(p,t)). (4) 

[p,i]eQ 

As the list iLx>([0, 0]) contains information about all notes from all scores 
in the collection, its size is proportional to the total number of notes in 
the collection. At least for large corpora this leads to unacceptable query 
response times. So we have to look for alternatives to get shorter lists. 

Although Formula (4) looks very similar to Formula (2), there exists 
a signihcant difference concerning the number of independent inverted 
files: in our music retrieval scenario this number is one whereas in the 
full-text retrieval scenario this number equals the number of different 
terms in all documents. Thus text retrieval has the advantage of many 
independent inverted files that are typically small as compared to the 
storage requirements of the whole collection. Hence it is advantageous 
to make sure that many independent inverted files exist also in the music 
retrieval scenario. 

There are different ways to achieve this goal. The hrst way is to 
reconsider the durations of the notes, resulting in documents Di and Q 
over where the additional component represents the note duration. 
Using shifts (vr, r) + Q ■= {[tt + p,T + t^d] \ \p,t,d] G Q} and inverted 
files t, d]) := {(vr, r, i) G Z^ x [1 : A^] | [tt -\- p,T -\- 1, d] G Di}, we 

note that now 



Hv{[p, t, d\) = Hv{[0, 0, d]) - {p, t). (5) 

Hence, the number of independent inverted lists equals the number of 
different durations occuring in the score collection. Besides fundamental 
difficulties of quantifying the duration, e.g., for staccato or grace notes, 
there will be extremely long lists for whole, half, quarter and eighth 
notes and these lists will typically be needed in most queries. So this 
alternative will generally not be suitable. 

A second way of obtaining shorter independent inverted files is to 
exploit a user’s prior knowledge. To avoid technicalities, assume that 
all pieces of the score collection are in 4/4 meter. Assume furthermore 
that each meter is subdivided into 16 metrical positions. When a user 
posing a query knows about the metrical position of the query, e.g., that 
the query starts at an offbeat, this may be incorporated as follows. We 
work with modified inverted files of the form 

■= {{P,k,i) I [p -h 7T, 16A: -h r] G A} 
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and note that = H!p{[0,T + 16£]) — (tt,— 0) (Check this!). 

Thus, instead of one independent inverted file we now have 16 indepen- 
dent files 

H!p{[0,t]) := {{n,k,i) \ [Tr,16k + t] € A} 

for t G [0 : 15]. For a query Q containing correct indications of the 
metrical positions, our task is to compute the set H!p{Q) := {{p,k,i) \ 
(p,16k) + Q C Di} which is equal to the intersection of all q 

ranging over all elements of Q. 

However, this only works if the user provides the retrieval system with 
accurate metrical information. If, in addition, the user knows about the 
exact pitches, then assuming 128 different pitches (like in the MIDI 
format), we now have 2048 independent inverted files 

■= {(P^kc) I [p,16k + t] G Di} 

for \p,t] G [0 : 127] x [0 : 15]. In this case, a user has to know both 
the exact pitches and the metrical positions. So this second alterna- 
tive requires a high degree of user knowledge. One could decrease this 
requirement by incorporating fault tolerance. We briefly discuss two 
mechanisms for achieving this: mismatches and fuzzy queries. 

The first mechanism considers the case that a query is not completely 
contained in a document Di. The elements of Q \ Di are called mis- 
matches. Using a variant of an algorithm (see Section 4) for computing 
the ratios \Qr\Di\/\Q\, one may efficiently determine all documents with 
at most k mismatches. 

Another possibility is to allow a user to pose fuzzy queries. In our 
music scenario, it could be the case that one is unsure about a certain 
pitch interval or the exact rhythm. Here, a user can specify alternatives 
for one note. Then a fuzzy query is a sequence F = (A, . . . , A) of n 
finite sets F) of alternatives. Such an F is the shorthand for a family of 
ordinary queries Q = {qi, ■ ■ ■ ,qn} where for each i, qi is allowed to take 
arbitrary values of F). A document Di is an exact partial match for a 
fuzzy query F A Q C Di, for some Q in this bunch of ordinary queries 
corresponding to F. If the F) are pairwise disjoint, then F consists of 
nr=i 1-^*1 ordinary queries. Although this number might grow rapidly, 
there are efficient algorithms to compute all matches, see Section 4. 

Finally, let us discuss a third alternative. Instead of notes \p, t] we now 
consider pairs of notes [p^^^]) as our basic objects. As a query Q 

in our score scenario typically refers to a contiguous part of the score, 
we do not need to store all pairs of Di x Di but only those which are 
close to another. Noticing that most rows of a score correspond to one 
particular voice each, the storage complexity is still linear in the total 
length of the collection. 
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Figure 4- Replacing a document D (left) with a subset of D2 (right) consisting of 
neighboring objects of D. Elements of D2 are specified by pairs of notes joined by an 
arrow. 



So documents Di and queries Q are replaced by suitable subsets of Dj2 
and Q2, respectively. Figure 4 illustrates this procedure. Considering 
pairs of notes is advantageous: we now have many independent inverted 
files: these are indexed by the pairs ([0,0], \p,f\) with p G [—127 : 127] 
and t G Z. For details see the next section. 

Prepared by these motivating examples, we are going to describe a 
general concept for multimedia information retrieval. This concept, 
worked out in detail in the next section, combines classical full-text 
retrieval techniques with methods from group theory. 

3. General Concept 

Let M be a set whose elements model the elementary data objects. 
A document over M is just a finite subset D of M. A collection (or data 
base) over M is a finite sequence V = (Hi, . . . , Dpj) of documents. 

In the classical text retrieval example, M = T is the set of all terms, 
whereas Di equals the set of terms extracted from the ith text document. 
In the full-text retrieval scenario, M = Z x IF is the set of all words 
together with positional information. The ith text document Tj is viewed 
as a sequence of words. This in turn may be described by the subset 
I i ^ [0 : nj\} of M. Finally, in our score retrieval 
example, M = [0 : 127] x Z or M = ([0 : 127] x Z)2 is the set of all 
(pairs of) notes. The ith score document is then a finite subset of M. 
(For technical reasons, in this last example we will replace M by the 
supersets 1? and (Z^)2, respectively.) 

A content-based query — after a possible preprocessing step — is again 
a finite subset Q of M. Hence, in our three examples, Q is a set of terms, 
a sequence of words, or a set of (pairs of) notes, respectively. 

To define more generally shifted versions of a query Q we use the 
concept of a group G acting on a set M. Recall that the (multiplicative) 
group G acts on the set M if there is a map G x M 3 {g,m) ^ gm G M 
satisfying for all g,h G G and m G M: g{hm) = {gh)m and lam = rn. 
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In this case, M is also called a G-set. Such a group action defines an 
equivalence relation on M: 

m ^ m! iff G G : gm = m' . 

The equivalence class containing m G M is the so-called G- orbit 

Gm := {gm \ g G G}. 

Hence the G-set M decomposes into the disjoint union of G-orbits. If R 
is a transversal of the G-orbits, i.e., a set of representatives containing 
exactly one element of each orbit, then 

M = y Gr. 

r&R 

By definition, the G-action on M is transitive if there is only one G-orbit, 
otherwise the G-action is intransitive. 

If G acts on M then G also acts on P{M) := {Q \ Q C M}, the power 
set of M, via 

gQ := {gq\qe Q}, 

where g G G and Q C M. As Q and gQ have the same cardinality, 
G acts on the set of all finite subsets of M, i.e., on the documents and 
queries over M. Thus the group elements precisely allow us to specify 
what is meant by a G-shift of a query. Furthermore, as we will see below, 
we can vary the group G to exploit the user’s prior knowledge. With 
this concept we can define a first information retrieval task. 

Definition 1 (Exaet partial {G,V)-matehes) 

Let V = (Di, . . . , Dm) be a eolleetion over the G-set M . For a query Q 
over M the set of all exaet {G,'D)-matehes is defined as 

Gv{Q) := {(<7, i) GGx[l-.N]\gQC DJ. • 

In the text retrieval example, G is the trivial group. Hence there is 
no loss of information if we simply replace {g,i) G Gx>{Q) by i. Thus 
Gx>{Q) = {f G [1 : A"] I Q C Di} equals the set Hx>{Q) from equation (1). 

In the full-text retrieval example, the additive group G = Z acts on 
Z X IT by {t,[j,w]) ^ [t + j,w]. In this case, Gx>{Q) consists of all 
pairs (t, i) such that the time-shifted version t -|- Q of Q is completely 
contained in Di. 

Finally, in the score retrieval scenario, the additive group G = acts 
on M = by vector addition. By the way, this action is transitive. 
Again, Gx>{Q) is the set of all pairs {{p,t),i) such that (p,t) -|- Q is 
completely contained in the ith document. 
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So far, in all examples a commutative group is acting. Here is a 
classical example of a noncommutative group action. 

A 2D color image I is modeled as a finite subset of x C, where 
[{x,y),c] G I describes the color information c G C of I at position 
(x,y) G M^. Let G denote the group of similarity transformations in M^. 
This noncommutative group is generated by rotations, translations and 
uniform scalings. If Q C x C* is a fragment of a 2D color image, then G 
acts on such fragments of images by gQ := {[g{x,y), c] \ [{x,y),c] G Q}. 
Thus gQ is a rotated, translated and rescaled version of Q. 

If D is a collection over the G-set M then our index will consist of 
(G, V)-inverted files or lists, defined by 

Gv{m) := {{g,i) G G x [1 : N] \ gm G A}, 

for m G M. One easily shows that for a query Q the corresponding set 
of all exact partial (G, D)-matches may be computed as the intersection 
of all (G, D)-inverted files specified by Q: 

GviQ) = n Gv{q). 

qeQ 

Thus to obtain all matches, it suffices to have access to all inverted lists 
Gx>((?). If M or G are infinite or of large finite cardinality, it might 
be impossible or impractical to store all inverted lists. However, in a 
number of cases, we can overcome this problem. The crucial observation 
is that the inverted lists of all elements in one G-orbit are closely related. 

Lemma 2 Gx>{gm) = Gx>{m)g~^ := {{hg~^,i) \ {h,i) G Gx>(m)}. 

Proof. The following chain of equivalences proves our claim: 

{h,i) G Gx>{m) hm G Di hg~^{gm) G Di 

{hg~^,i) G Gv{gm). • 

Thus if the multiplication in the group G is not too involved, we can 
quickly recover Gx>{gm) from GT>{m), which might lead to dramatic 
storage savings. To be more precise, let i? be a transversal of the G- 
orbits of M. Then every element m G M can be written as m = gm'f'm 
with a uniquely determined Vm G R and an element gm G G which is 
unique modulo Gm ■= {d ^ G \ gm = m}, the stabilizer subgroup of 
m. By the last lemma it is sufficient to store only the inverted lists 
corresponding to the transversal R. Then the remaining lists can be 
computed on demand according to the following result. 
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Theorem 3 For the set of all exaet partial {G,V)-matehes w.r.t. the 
query Q C M the following formula holds: 

GviQ) = n Gv{rq)gf\ 

q&Q 

Proof. Combine Gx>{Q) = ngggGx)(g) and q = gqUq with the last 
lemma. • 

How time and space consuming is the computation of all exact par- 
tial (G, P)-matches along the formula of the last theorem? Let us first 
discuss storage requirements. 

Theorem 4 With the notation of the last theorem suppose that the sta- 
bilizer subgroups Gm for all m G M are trivial. Then the sum of the 
lengths of all {G,V)- inverted lists eorresponding to a transversal R of 
the G-orbits of M equals the sum of the eardinalities of all doeuments: 

^ length (G© (r) ) = ^ |A|- 
reR ie[l:A'‘] 

Proof. As all stabilizers are trivial, each m G M has a unique decom- 
position m = gm^m with g^ G G and G R- Thus each m G Di 
contributes exactly one entry to exactly one list: {gm,i) £ Gx>{rm)- • 
lim G Di has a nontrivial stabilizer Gm and if m = gmf with gm G G 
and r G R, then m contributes exactly \Gm\ entries to the inverted 
list Gx>{m), namely all pairs of the form (gmg,i) with g G Gr- So if 
all stabilizers are small (|Gm| < c, say) then a slight modification of 
the above reasoning shows that the above equality can be replaced by 
X^rg/jlength(Gx)(r)) < \Di\. If there are large stabilizers, then 

the above formula should not be applied directly. Instead, a redesign of 
the set M of elementary data objects might be helpful to force trivial 
stabilizers. This trick will be discussed below. 

In what follows we concentrate on the case that all stabilizers are 
trivial. How fast can we compute an intersection like ngggGx)(rg)gr“^? 
To settle this question let us first discuss the problem of generating the 
inverted lists. To accelerate the intersection task, we first define a lin- 
ear ordering on the group G and suppose that we have an algorithm 
that efficiently decides for g,h G G whether g = h, g < h, or g > h. 
For the moment, we also assume that we know a transversal R of the 
G-orbits of M and that we know an efficient algorithm for computing 
M 3 m {gm,rm) G G X R, where m = gm^m- Under these assump- 
tions we may, for each element in all the documents, efficiently compute 
the corresponding inverted file: m = gm^m G Di yields the entry {gm,i) 
in Gj){rm)- Finally, we linearly order the entries in each inverted file 
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according to their first component, which is an element of the linearly 
ordered group G. Note that not all G-orbits have to be involved in the 
documents of a particular database Di, . . . , Djy. Suppose that R' C R is 
minimal with U^g/j/Gr D then we have a total of |i?'| inverted 

files. If the group operation is compatible with the linear ordering of the 
group, i.e., x < y implies xg < yg, for all x,y,g G G, then the linear 
ordering of the inverted files Gx>(r) for r G R can be used directly when 
computing Gt>(Q) using the formula Gx>{Q) = r\q^QGx>{rq)g~^ . Accord- 
ing to our assumption, each Gx>{rg) is linearly ordered, and hence so is 
GT>{rq)g~^ . Consequently, we have to compute the intersection of n lin- 
early ordered lists of length G < ■ ■ ■ < £n, say. If A* is the cardinality of 
the intersection of the first i lists, then computing Gx>{Q) with the binary 
method requires at most G logl'j+i < (n — 1)G login comparisons 

in G as well as at most n inversions and G + In multiplications in 
G. 

Surprisingly, a similar complexity result can even be obtained when 
the linear ordering of G is not compatible with the multiplications in G. 
This is based on the following simple but very useful observation. 

Lemma 5 Let a, b be elements and A, B subsets of the group G. Then 
Aa -L Bb = {Aab~^ T B)b = (A T Bba~^)a, for Tg {n, U, \}. 

Proof. Straightforward. • 

In the worst case, both Aa and Bb are not linearly ordered in contrast 
to A and B. Nevertheless, when computing, e.g., the intersection AaCiBb 
we can use either the linear ordering of A by computing (A n Bba~^)a 
or that of B by computing {Aab~^ (1 B)b. 

Theorem 6 Let Q be a query of size n, Q = {gi, . . . , qn}- Suppose that 
the lists Gx>{qi) are ordered aeeording to their lengths G G ■■■< In, 
where G := \Gxi{qi)\- For j G [1 : n] let Xj denote the eardinality of the 
interseetion njg[i.j]Gx)(gj). Given the linearly ordered lists Gx>{r) for 
all r G R, the list Gx>{Q) of all exaet partial {G,T))-matehes of Q ean 
be eomputed with YlllGi GlogG+i < (n — l)Glog in eomparisons in G, 
along with n deeompositions qi ^ (gi,ri) G G x R satisfying qi = giVi, n 
inversions gi ^ g~^ , n — 1 multiplieations gf^gi^i, and Xir=i 
multiplieations in G. 

Proof. Let qi = giVi with gi G G and r* G R. Then 

n n 

Gv{Q) = n Gv{qi) = n Gv{n)g-\ 

i=l i=l 
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According to the last lemma we compute Gx>(Q) as follows (here shown 
for the case n = 3 using the shorthand Hi := Gx>{ri)): 

Hig^^ n ^25-2^ ^ ^393 ^ = i{Hig^^g2 n ^^ 2 ) 5-2 ^53 n 

This formula suggests the following procedure: first, compute all de- 
compositions Qi ^ (gi,ri). Then invert all gi and compute all g~^gi^i. 
Afterwards, inductively compute linearly ordered sets 

'~ ( ri ^i9i )9j- 

Note that Ai = Hi is already linearly ordered. As Aj+i = AjgJ^gj^i n 
Hj and Hj is linearly ordered, we can compute this intersection using 
the binary method. We finish the proof by mentioning that Aj has 
cardinality Xj and Ang~^ equals Gx>{Q)- • 

Roughly speaking, the last theorem tells us that the number of oper- 
ations to compute Gx>{Q) is at most the product of the cardinality of Q, 
the length of the shortest list and the logarithm of the longest list among 
all lists corresponding to Q. Hence it is profitable to have many short 
lists. But what do we do if G acts transitively on M? In this case one 
should look at intransitive G-sets, closely related to M. Here are two ex- 
amples: if n > 2 then G acts on V<n{M) := {X C M | 0 < \X\ < n} by 
gX := {gx \ x € X} and on M"- by g{mi, . . . ,m.n) := {gmi, . . . ,gmn)- 
Both actions are intransitive. The next result shows a close connection 
between exact partial matches in various G-sets. 

Theorem 7 LetV = (Di, . . . , Dj\f) denote a doeument eolleetion over 
the G-set M. This induees new eolleetions := , . . . , D"^). Simi- 

larly, for a query Q C M and V<n(T>) := (V<n{Di), . . . ,V<n{D]sf)) we 
assoeiate the queries V<n{Q) T V<n{M) and C M”. The eorre- 
sponding sets of exaet partial matehes are equal: 

GviQ) = Gr,uv)i'P<nm = Gv-{Q^. 

Proof. For {g, i) G G x [1 : A"] we have 

{g,i)eGv{Q) gQTDi 

^ VX G V<n{gQ) = gV<n{Q) : X g P<n(A) 
^ gV<n{Q) c V<n{Di) 

{g, i) G G-P^^ (V){V<n{Q))- 

This proves the first equality. The second equality follows in a similar 
way using (gQ)"' = gQ^- • 
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It should be clear that the above intransitive G-actions are just two 
generic constructions which are available in every case. Depending on a 
particular application one will possibly succeed with smaller G-sets, as, 
e.g., the action on Vn{M) = {X C M \ \X\ = n} instead of V<n{M) = 
{X C M I 0 < |X| < n}. 

Here are some specific examples. G = (M^,+) acts transitively on 
M := by vector addition. The induced actions of G on both V 2 {M) 
and M2 are intransitive. In fact, the G-orbit of {x,y} G V 2 {M) is 
characterized by ±(x — y), whereas the G-orbit of {x,y) G M2 is char- 
acterized by re — y. Thus if the documents consist of finite subsets Di of 
and the group of all translations is the group in question, then it is 
profitable to view pairs or 2-sets of vectors in as the new elementary 
data objects. If, however, the group Gs of all similarity transformation in 
3-space (generated by all translations, rotations and uniform scalings) 
is the group of choice, then M2 decomposes only into two G^-orbits 
represented by ([0, 0, 0], [0, 0, 0]) and ([0, 0, 0], [0, 0, 1]), respectively. Fur- 
thermore, Gs acts transitively on V 2 {M). However, V^^M) decomposes 
into many G<j-orbits. These orbits correspond to the classes of congruent 
triangles in Euclidean 3-space. Thus if G = G^ then one should switch 
from M = to V^{M). Besides many G«-orbits by this switch we 
also obtain small stabilizers. In fact, all stabilizers are isomorphic to a 
subgroup of the dihedral group of order 6. 

A switch from M to Vk{M) has the drawback that a document Di C 
M with di elements has to be replaced by the new document Vk{Di) 
which consists of ('^*) elements. Fortunately, in most applications a 
query Q will typically be not a random but a structured subset of M. 
For example, if (M, d) is a metric space and 0 > 0 a prescribed constant, 
a possible property of Q could be its ^-connectedness, i.e., there is an 
enumeration oi Q, Q = {gi, . . . , qn}, such that d{qi-i, qi) < 6, for all i G 
[2 : n] . Thus instead of Vk{Di) in this case it is sufficient to work with the 
typically much smaller set Di^^^y := {X G Vk{Di) \ X is ^-connected}. A 
query Q = {gi, . . . , with 0 < d{qi-i,qi) < 6 should then be replaced 
by Q' = • • • ,g(A+i)fc} I 0 < A < [n/k\} U {{qn-k+i, ■ ■ ■ ,qn}}- 

4. Fault Tolerance 

So far, we have mainly discussed exact partial matches. Typically, 
there are many sources of impreciseness. Think, e.g., of a non profes- 
sional user humming a melody into a microphone. Another example 
could be the search in a database of signals where the queries consist of 
noisy or lossy compressed versions of the original signals. In all of those 
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cases, a certain degree of fault tolerance is required. Now we will discuss 
two general kinds of fault tolerance: mismatches and fuzzy queries. 

We start with mismatches. Let Q be a query and D a document over 
M. The elements of QCiD form the matching part, whereas the elements 
in Q that do not belong to D are called the mismatches. Thus for a fixed 
non-negative integer k and a query Q the set 

GUQ) ■■= {(5, i) G G X [1 : N] I \ A| < k} 

specifies all partial (G, T*)-matches with at most k mismatches. Obvi- 
ously, G^{Q) equals Gx>{Q). For A: > 0 and {g,i) G G^{Q) the trans- 
formed query gQ is contained in Di with up to k mismatching elements. 
Fig. 5 illustrates the concept of mismatches. 




Figure 5. A query Q (left) is matched to a certain location of a document D (middle) 
using a shift by g. On the right, the matching positions D n gQ are represented by 
gray boxes, whereas the two mismatches gQ \ D are marked by dashed boxes. 

To determine G^{Q), we use a dynamic programming approach. Let 
Q = {o'!) • • • ) In} and define Gj := Gx>{qj) as well as Fj := Gi U . . . U Gj. 
We inductively define credit functions Cj: Fj ^ Z as follows. Fi( 7 ) := 
A: -|- 1, for every 7 G Fi. For 2 < j < n we define 

( Gj_i( 7) if7GFj_inGj 

G,(7):=<^ G,-i( 7)-1 if 7 GF,_i\G, 

[ k + 2-j if7GG, \F,_i. 

Theorem 8 The elements o/F„ with a positive credit are just all partial 
{G,T>)- matches with at most k mismatches: 

GUQ) = {7 G FJ G„(7) > 0}. 

Proof. Let (g,i) G Fj and Qj := {qi, . . . ,qj}. By induction on j we 
show that Cj{g,i) = k + 1 — \gQj \ L)j|. The start j = 1 is clear. To 
prove the inductive step (j — 1 ^ j) we distinguish three cases. 

Case 1: {g,i) G Fj_i n Gj. As gqg G A, gQj \ A = gQj-i \ Di. 
Thus Cj{g,i) ■= Gj-i{g,i) = k + l- \gQj-i \ Di\ = k + 1 - \gQj \ A|- 
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Case 2 : (g,i) e Tj_i\Gj. As gqj 0 A, \gQj\Di\ = \gQj-i\Di\ -\-l. 
Thus Cj{g,i) := Cj-i{g,i) - 1 = k -\- 1 - {\gQj-i \ Di\ -\- 1) = k -\- 1 - 

\gQj \ A|- 

Case 3: (g,i) £ Gj \ Tj-i. Then gqj £ Di, but gq^ 0 A, for all 
I £ [1 : j — 1]. Hence \gQj \ A| = J — 1- Thus Gj{g^ i) \= k -\- 2 — j = 
k + l-{j-l) = k+l - \gQj \ A|- 

As Qn = <3, we get Gn{g, i) = k -\- 1 - \gQ \ Di\, for every {g, i) £ T^. 
Thus Cn{g,i) > 0 iff \gQ\Di\ < k, i.e., {g,i) £ • 

To perform a complexity analysis of fc-mismatch search, we first note 
that Gj{'y) < Cj-i{'y), for all 7 G Tj_i. Hence in a practical implemen- 
tation we can replace Tj by T-^ := Tj \ {7 G Tj_i | Cj_i( 7 ) < 0}. Then 
Tj = A, for all j G [1 : /c -h 1] and T1 C T2 C . . . C T^+^ D T^+2 D . . . D 
T”. Let ij and gj denote the lengths of Gj and A, respectively. With 
Lemma (3.5) and a similar technique as in the proof of Theorem (3.6) 
we can compute G^{Q) with at most Y^j^i min{£j_|_i log Rj, Rj logj_|_i} < 
Llog;Ufc+i comparisons, where L denotes the total length of all involved 
lists. In addition, we have to perform 2n -|- multi- 

plications or inversions in the group. 

Let us now turn to fuzzy queries. Recall that a fuzzy query over M 
consists of a sequence F = (A, . . . , A) of n finite sets A of alternatives. 
We associate to such an F a family of ordinary queries 

Q(F) := {{gi, ...,qn}\yi: qi £ A}. 

The set Gx>(F) := {{g,i) \ 3Q £ Q(F) : gQ C A} specifies the set of all 
exact partial (G, T*)-matches w.r.t. the fuzzy query F. Obviously, Gx>(F) 
is the union of all Gx>(Q), where Q ranges over all ordinary queries asso- 
ciated to F. As already mentioned, if the A are pairwise disjoint, then 
F consists of 01*=! 1-^*1 many ordinary queries. So, the naive algorithm 
that separately computes sets Gv{Q) for each Q £ Q(F) and finally 
merges those sets is rather inefficient. Fortunately, the following result 
indicates an efficient algorithm to compute Gx>(F). 

Theorem 9 Let V = (Di, . . . , Dn) be a eolleetion of doeuments over 
the G-set M. //F = (A, • • • > A) is a sequenee of subsets of M then the 
set Gx>(F) of all exaet partial {G,'D)-matehes w.r.t. the fuzzy query F 
may be eomputed using the following formula 

Gv{F)= f| (|jG^(g))= f| {\J Gv{r,)g-^). 
ie[l:n] q&Fj j'e[l:n] q&Fj 

Proof. (g,i) £ Gx>(F) iff gQ C Di, for some Q £ Q{F). Equivalently, 
there exists an element qj £ Fj satisfying gqj £ Di, for all j G [1 : n], 
i.e., (g,i) £ Cgj^FjGx>{qj), for all j, proving our claim. • 
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The complexity analysis for computing Gx>(F) is straightforward and 
left to the reader. It turns out that Gx>(F) may be computed with a 
number of comparisons which is — modulo logarithmic factors — linear in 
the total number of entries in all involved lists. 

To incorporate prior user knowledge, we consider subgroups U < G. 
Then, for a transversal ii 9 Ic of C/’s cosets in G we have G = U^g/jC/r. 
Using this decomposition, G- and [/-inverted lists are connected by 

= |_| Uv{rm)r, 

r&R 

for each m G M. Then, a speed-up in query processing when restricting 
ourselves to subgroups U < G may result from the fundamental property 

Gv{Q) = |_| Uv{rQ)r, 

r&R 

for all Q C M. Hence, Gti{Q) consists of many lists of the form Ux>{rQ)r. 
Now, assuming prior knowledge about the coset of a match g ^ G w.r.t. 
U, as in the above example where a user knows about the exact metrical 
position of a query, we only need to determine the list for the case r = Iq, 
i.e., UviQ). 

5. Applications, Prototypes, and Test Results 

In this section we give an overview on several applications of the pro- 
posed indexing and search technique. A more detailed treatment for 
the case of music retrieval may be found in our related work ([Clausen 
and Kurth, 2003]). We describe prototypic implementations and give 
some test results demonstrating time- and space-efficiency of the pro- 
posed algorithms. Recall that for each application we have to specify an 
underlying set M of elementary objects, a group G operating on this set, 
and a transversal i? C M of G-orbits to specify the inverted file index 
{Gv{r) \ r £ R}. 

5.1 Content— Based Music Retrieval 

Score-based polyphonic search has been used as a running example 
within this paper. In our PROMS-system ([Clausen et ah, 2000]) we 
used the set M := Z x [0 : 127] of notes consisting of onset-times and 
MIDI-pitches. The search is carried out w.r.t. the groups G := (Z,-|-) 
and V := (16Z,-|-) of time-shifts, where G shifts by metrical positions 
and V by whole measures each consisting of 16 metrical positions. As 
discussed above, the latter models prior knowledge about the metrical 
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position of a query within a measure. The corresponding transversals R 
are defined as above. 

Our database consists of 12,000 classical pieces of music given in the 
MIDI format. The pieces consist of a total of about 33 million notes. 
Response times for queries of various lengths are summarized in Ta- 
ble 1. The response times for each query length were averaged over 100 
randomly generated queries. As the table demonstrates, our query pro- 
cessing is very fast. In addition, the index requires only a small amount 
of disk space and indexing of 330 MB of polyphonic music takes only 
40 seconds. The uncompressed index requires 110 MB of disk space. 
Compressing the inverted lists using Golomb coding results in reducing 
the space requirement to 22 MB. 



a 


4 


8 


12 


16 


20 


30 


50 


100 


b“ 


51 


86 


92 


97 


100 


107 


125 


159 


c 


1 


5 


7 


10 


12 


19 


31 


64 



Table 1. Average total system response time (row b) in ms for different numbers 
of notes per query (row a). Row c: Disk access time for fetching inverted lists. 
(Pentium II, 333 MHz, 256 MB RAM). 



5.2 Audio Identification 

The task of audio identification may be described as follows. Given 
a short part q of an audio track and a database xi, . . . , xat of full-size 
audio tracks, locate all occurrences of q within the database tracks. In 
this, a pair (t, i) determines an occurrence oi q m Xi q = Xi[t : t -\-\q\ — 
1]. Note that this problem may be stated in terms of our group-based 
approach since an audio signal s : Z ^ M may be interpreted via its 
graph as a subset 5 C Z x M where G = (Z, -b) operates by addition in 
the Z- (time) component. For robustness- and space-efficiency reasons, 
audio signals are preprocessed using a G-invariant feature extractor F : 
— > 'P(Z X X), where X denotes a set of feature classes (in this 
case G-invariance denotes the usual time-invariance). As an illustration 
we sketch a feature extractor which extracts significant local maxima 
from a smoothed version of a signal. For a detailed treatment of more 
robust feature extractors, we refer to [Ribbrock and Kurth, 2002]. F will 
be composed from several elementary operators which are each maps 
on the signal space, i.e., maps ^ M^. First, an input signal s is 
smoothed by linear filtering. The corresponding operator is G/ [s] : n 
^^g^/(fc)s(n — k), where / denotes a signal of finite support. Next, 





50 



COMPUTATIONAL NONCOMMUTATIVE ALGEBRA 



-fT-significant local maxima are extracted by an operator 

{ x(n) if x{n — K) < ■ ■ ■ < x{n) A 
x(n) > • • • > x(n + K), 

0 otherwise. 

The resulting signal is again processed by an operator M'^, extracting 
local maxima. M'j^, is defined exactly as Mk, but regards only the sup- 
port of the input signal. M'^, usually returns a very sparse output signal. 
The operator A assigns to each non-zero position of an input sequence 
the distance to the previous nonzero position (provided existence), and 
zero otherwise. Finally, let Q\x\ denote an, e.g., linear, quantizer which 
reduces a signal’s amplitude to |A| feature classes. Note that Q\x\ is 
an operator V{1^ x X) which in addition to quantization discards 

zero-positions of a signal. Then our feature extractor may be written as 
F := T>c o A o M'j^, o Mx o Cf. In an example we could choose K = 5, 
K' = 3, and A = [1 : 50]. To construct our search index, we calculate 
F[xj\ C Z X A for each signal Xi of our database. In this {[0, x] | x G A} 
serves as a transversal for index construction. 

We briefly summarize some results of our extensive tests in the au- 
dio identification scenario. Our database consists of 4500 full-size audio 
tracks. This approximately amounts to 180 GB of original data or 13 
days of high quality audio. Using the above significant maxima as fea- 
tures, we obtain an (uncompressed) index size of about 128 MB which 
is a compression ratio of about 1:1,400 as compared to the original data. 
Using different feature extraction methods, the index size may be further 
reduced to sizes of 1:5,000 or even lower. The query times range from 
only a few milliseconds (higher quality queries) to about one second. 
The required length of a query signal depends on the feature extractor 
and ranges from a few fractions of a second (significant maxima features) 
to 5-15 seconds (robust features and low quality queries) ([Clausen and 
Kurth, 2003]). 

5.3 Content-Based Image Retrieval 

In content-based 2D- or 3D-retrieval we are interested in finding pos- 
sibly translated, rotated, or (uniformly) scaled versions of a query object 
in an underlying database. In this overview we shall consider transla- 
tions and rotations only. Hence, the groups of interest are the group 
Tn of translations in M"", the orthogonal group On-, and the group of 
Euclidean motions £n '■= Tn x On- 

In content-based image retrieval, we are working with 2D images D C 
X N = P and are hence interested in the groups T 2 and £ 2 - Assume a 
suitable feature extractor yielding a set of features F{D) C x A for an 
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image D. T 2 acts on P’s first two components as described above. Hence, 
after feature extraction, we may create an index based on the transversal 
{[0,0, x] I X G X} C P. In our extensive tests described in [Roder, 
2002] we investigated several kinds of feature extractors including corner 
detectors, gray value statistics and histograms. 

Looking at retrieval under the group £2 acting on 2D points from P, 
we face the problem that each point x G is mapped to any other 
point y G by infinitely many elements from £ 2 , resulting in inverted 
lists of infinite size. Hence we resort to indexing line segments, which 
are modeled by the set V 2 {M) := |L C M | |L| = 2} of all two- 
element subsets of M^. Using M' := ^ 2 ( 1 ^^) x X the line segments 
{[(0, a), (0, — a), x] I X G X, o > 0} serve as a transversal for indexing. 
In [Roder, 2002], several types of features were tested in the latter set- 
ting. 

5.4 Searching 3D-Scenes 

We investigated content-based search in 3D scenes for the case of 
a database of VRML (Virtual Reality Modeling Language) documents 
([Mosig, 2001]). To obtain feasible inverted lists, the elementary objects 
were chosen to be all 3-sets in M^, i.e., M = P 3 (M^), interpreted as the 
sets of all triangles in M^. Hence for indexing, all VRML documents 
were converted into documents consisting of triangles only. Indexing 
was performed for the groups Pa of 3D translations and for the group £3 
of Euclidean motions in M^. As a transversal of the Ts-orbits we chose 
all sets of triangles with the origin as the center of gravity. Additionally, 
each representative is rotated such that one specific edge runs in parallel 
to the x-axis, this edge depending on the triangle having one, two, or 
three different side lengths. This way, one obtains a finite set of inverted 
lists for this application. 

Fig. 6 illustrates the concept of 3D-retrieval using an underlying toy- 
database of 3D-objects (on the left). The top of the figure shows a part 
of an object which is used as a query to the database. An index-based 
search results in one object of the database matching the query (on the 
right). The position of that object matching the query is highlighted. 

6. Related Work and Future Research 

The techniques presented in this paper are related to work from across 
several communities. In this section we try to establish the most impor- 
tant relations to previous as well as ongoing research efforts. 

From a classical database point of view, multimedia data may be 
modeled using a relational framework. In the usual approach, mul- 
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Figure 6. Toy database of 3D-objects (left), query object (top), and matching 
database object (right). The matching position is highlighted. 



timedia documents are preprocessed yielding certain feature vectors. 
The extracted features are then suitably stored in tables of a relational 
database ([Santini and Gupta, 2002]). Using relations, it is possible to 
model complex object dependencies like spatial constraints on regions 
in an image. Efficient retrieval methods have been proposed using ap- 
proximate search like hill climbing ([Papadias, 2000]). Although our 
approach may be extended to a relational setting by introducing per- 
mutation groups, the methods proposed in this paper were primarily 
developed to exploit the structure of the underlying object set M and 
the group G acting on M (e.g., musical documents are structured by 
specific time- and pitch- intervals between single notes which are not 
changed by the group action). In this light, our model constitutes a 
special case of the general relational setting which, however, allows for 
very efficient query evaluation. 

An important issue in multimedia indexing is the use of multidimen- 
sional access structures like k-d- or R*- Trees ([Lu, 2002]). Popular in- 
dexing approaches map multimedia documents such as time series to 
higher dimensional feature sequences and use multidimensional access 
structures for searching in those structures ([Faloutsos, 1996]). In our 
approach we, as far as possible, tried to avoid higher dimensional features 
in order to avoid problems resulting from the dimensionality curse. In 
indexing audio this became, e.g., possible by exploiting the fixed tempo- 
ral relationships between the features. On the other hand, when dealing 
with more complex groups (e.g., the group of Euclidean motions in 3D 
leads to a 6-parameter representation for each element), our approach is 
also dependent on efficient algorithms for higher dimensional range and 
nearest neighbor search ([Agarwal, 1997]). 

Specializing to time-series, there has been considerable recent interest 
in searching a query sequence in large sets of times-series w.r.t. various 
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distance measures. Examples are Euclidean, or dynamic time warp- 
ing distances ([Keogh, 2002, Moon et ah, 2002]). Whereas our approach 
has up to now only been applied to audio signals, its high performance 
suggests an extension to include general time-series search under the 
latter distance measures. For the particular case of audio identification, 
several methods suitable for large data collections have been proposed 
recently. Among those, the hashing algorithms proposed by ([Haitsma 
et ah, 2001]) are the most similar to our approach, as hash tables are 
related to inverted files. Our approach has the advantage of needing sig- 
nificantly less memory for storing the index (about 100 MB as compared 
to 1-2 GB) with otherwise comparably (fast) performance data. 

In music retrieval, most of the early work has concentrated on sim- 
ilarity based retrieval of melodies, see, e.g., [Uitdenbogerd and Zobel, 
1999]. For a long time, retrieval in polyphonic music (see, e.g., [Lem- 
strom and Perttu, 2000]) already suffered from a lack of suitable data 
modeling. Our technique led to a breakthrough in allowing to model 
as well as efficiently search polyphonic music ([Clausen et ah, 2000]). 
As our approach is up to now mainly focused on modeling and efficient 
retrieval, the use of music similarity measures is a natural challenge for 
future work. 

An approach which is similar in spirit to our general technique is 
geometric hashing for object recognition proposed in Computer Vision, 
see [Wolfson and Rigoutsos, 1997]. This approach shares the modeling 
of shift operations which are considered for 2D/3D settings as well as 
the exploitation of the data’s structural (geometric) properties. In our 
approach the data modeling is more general, which yields advantages in 
designing more efficient fault-tolerant retrieval algorithms. 

An extension of our approach ([Clausen and Mosig, 2003]) including 
general distance measures between query and matching position leads 
to (shape-) matching problems which have been extensively treated in 
the area of computational geometry ([Veltkamp, 2001]). It will be a 
great challenge for future work to try to combine our approach, which is 
more tuned for use with larger datasets, with the sophisticated geometric 
matching techniques. 

Many approaches to content-based image retrieval have been proposed 
in the last years, among which perhaps IBM’s QBIC system (Query by 
Image Content) is the most popular. In comparison to those, a consid- 
erable advantage of our technique is the natural integration of partial 
matches and the ability to locate queries as subimages of database im- 
ages at no additional cost. An interesting direction is the Blobworld 
approach by [Carson et ah, 1999], of region-based image representation 
and retrieval, where image descriptors are created from a prior segmen- 
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tation into similarly textured regions. It would be very interesting to 

combine such texture descriptors with our approach to spatial indexing. 
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1. Introduction 

Radar, short for RAdio Detection And Ranging, was invented almost 
a century ago by Christian Hulsmeyer in Diisseldorf, Germany. His 
patent for telemobiloskop. No. 165,546, issued 30 April 1904, was a 
collision prevention device for ships. However, Robert Watson-Watt is 
often given credit for inventing radar. Thirty years later, researchers 
were experimenting with radio transmission and reception at the Naval 
Research Laboratory in Washington DC. The experiment set up was such 
that the communications stations were on opposite sides of the Potomac 
River. Interference occurred each time a ship passed between these two 
communication stations, and proved to be a reliable indicator of an 
object present regardless of weather conditions. Since that time, great 
strides in radar have brought us air traffic control, airborne synthetic 
aperture radar for crop production assessment, and much more. 

Radars operate via transmission through a directional antenna. Ob- 
jects within the field of view scatter radio frequency (RF) energy in all 
directions, some towards the receiving antenna. Reflections from the 
clutter are undesired, and may mask signals from targets. 

Throughout the twentieth century, great progress in and numerous 
applications of radar emerged as being commercially and militarily suc- 
cessful. Initially, investigations into electromagnetic theory supported 
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experimentation with radio wires. Then, in the 1930’s, interest in the Ul- 
tra High Frequency (UHF) (300 MHz) band resulted in the first surveil- 
lance and fire control systems. By the 1940’s, microwave radars operat- 
ing in the 500 MHz to 10 GHz band were developed. This resulted in 
long range search and track radars by the late 1950’s. 

Advances in waveform and signal processing technology ushered in a 
new era starting in the 1960’s. The first airborne radars were devel- 
oped. Phased array technology made low sidelobe antennas available 
for the Airborne Warning And Control System (AWACS) wide area 
surveillance platform. Sidelobe cancellers emerged as the first adap- 
tive signal processing technology designed to mitigate electromagnetic 
interference and jammers. Generalization of the sidelobe canceller con- 
cept, in conjunction with multi-channel phased array technology, led 
to space-time adaptive processing algorithms, architectures, and system 
concepts by the 1980’s. During this same time, space-based radar (SBR) 
emerged as technically feasible for wide area surveillance for theater wide 
surveillance. As the 20th century came to a close, advances in digital 
technology, computing architectures and software, and solid state radio 
frequency devices offered some of the most exciting opportunities for 
fielding new radars with previously unheard of capabilities. All of this 
leads to the four challenge problems in radar discussed below. 

2. Radar Fundamentals 

In Figure 1, a simplified block diagram presents the fundamental 
building blocks and inter-connectivity essential to the functioning of a 
radar. Here, a classical system using a reflector antenna is presented, 
while modern systems use phased arrays and multi-channel receivers. 

In a radar system, the modulator generates a low power signal which 
drives the transmitter. In the transmitter, the signal may be converted 
in frequency from baseband, and is amplified, often to many kilowatts 
of peak power. The duplexer protects the receiver during transmit, and 
directs backscattered energy to the receiver. The antenna focuses trans- 
mit energy into narrow beams to localize in angle, as well as intercept 
returns from targets on receive. The receiver amplifies the return signal 
in order to overcome component noise, down converts the radar signal to 
a low intermediate frequency (1 MHz), match filters the radar returns, 
envelope detects the signal and digitizes it. The synchronizer is used 
for waveform generation, timing, and control of the transmit pulse, and 
measures range to the target on receive. The signal processor is designed 
to separate target returns from clutter and other interference, and es- 
timate target parameters. The tracker further processes radar returns 
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Figure 1. Basic Radar Architecture. 



to present target history and predict future position for display to the 
operator. 

The radar range equation is used to size a system to mission require- 
ments. In its simplest form, the range equation starts with transmit 
power Pf, at a distance R from the radar with mainbeam transmit an- 
tenna gain Gt, the power density is 



PtGt 

The power intercepted by a target of radar cross section a and re- 
radiated towards the radar is 

PtGta 

47Ti?2 • 



The power density of the target return at the radar is 



The received power is 



where 



PtGta 

(47ri?2)2 



PtGtcrAr 

(47ri?2)2 



Ar — 



X^Gr 

47T 



( 3 ) 

( 4 ) 

( 5 ) 
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is the receive aperture at the radar. The result is 

PtGtGA^a 

(47t)^ 



( 6 ) 



The received power, can be written in terms of signal-to-noise 
ratio (SNR), S/N ^ and thermal noise power kToENp, where k is Boltz- 
man’s constant, Tq is the noise temperature of the radar, B is the noise 
bandwidth of the radar receiver, and Np \s its noise figure. Substituting 
in to Equation (6), we get 

P = {S/N) KTqBNf = , (7) 

^ ’ (47r)3p4 ^ ^ 



This equation must be satisfied to achieve detection performance com- 
mensurate with a given SNR. In addition to noise figure {Np), numerous 
other losses, L, degrade the detection performance of a radar. They in- 
clude hardware losses, atmospheric effects, and beam and filter shape 
losses, etc. The maximum range of a radar may then be computed using 
the formula 

4 ^ PtGtGrX^a 

{4TrfKToLBNp{S/N\^^' 



In conventional radar, the output of the receiver is detection processed 
and compared to an adaptive threshold to determine target present (hy- 
pothesis Hi) or target absent (hypothesis Hq). From here, target decla- 
rations are handed off to the tracker for further analysis and handoff to 
the fighter/interceptor. 



3. Radar Waveforms 

In airborne early warning radar for wide area surveillance, a number 
of tradeoffs must be considered before waveform parameters are selected. 
Among them are continuous wave or gated-wave modes of operation. In 
gated-wave operation, the pulse width and the pulse repetition rate must 
be selected to be comparable with the target characteristics as well as the 
radar hardware available to the systems engineer. Since Fourier analysis 
is a standard tool available for radar signal analysis, the gated waveform 
should be composed of a number of pulses regularly repeated at a high 
enough rate to meet the Nyquist sampling criterion. However, too high 
of a repetition rate causes other problems, most notably, ambiguities in 
range. With range ambiguities, distant target returns may be masked by 
very strong close-in clutter. As such, the pulse repetition rate may have 
to be lowered. The tradeoffs in pulse duration, pulse repetition rate, and 
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pulse shape impact radar resolution, accuracy, and ambiguities not just 
in range, but in Doppler as well. Doppler is important in radar because it 
is proportional to target radial velocity. Through velocity measurements, 
target position and engagement procedures are established. 

Considering the tradeoff between low, medium, and high pulse rep- 
etition frequency (PRF), several issues must be discussed. Low PRF 
waveforms offer precise unambiguous measurements of range, and per- 
mit simple elevation sidelobe clutter rejection. This must be balanced 
with difficulties in resolving Doppler ambiguities, and suffer from poor 
performance against ground moving targets. 

In high PRF systems, range ambiguities compound the clutter re- 
jection problem, but the clutter free Doppler may be quite large. An 
appropriate compromise may be medium PRF, if low elevation sidelobes 
are easily incorporated into the systems designed. Also complicating the 
waveform design are pulse compression and modulation characteristics. 
The purpose of pulse compression is to permit high resolution modes 
of operation with long duration large bandwidth pulses that meet en- 
ergy on target requirements established by the radar range equation. 
Waveform diversity enabled by the plethora of possibilities in waveform 
generation, timing and control, is now commonly discussed in the litera- 
ture. Waveform diversity can encompass many aspects of the signal de- 
sign problem, including frequency division multiplexing, pseudo-random 
phase coding, and pulse compression chirp rate diversity. Similarly, this 
concept can easily be expanded to encompass both temporal and spatial 
diversity. One simple concept for waveform diversity, called spatial de- 
nial, prohibits non-cooperative receivers from interfering with mainlobe 
signals by altering the nature of the sidelobe structure. This can be ac- 
complished through the application of individual waveform generation, 
timing and control electronics at each channel in a phased array, or by 
augmenting existing equipment with auxiliary antennas (at least two) 
designed to accomplish the same mission. This is illustrated in Figure 
2. 

Spatial denial is achieved as illustrated in Figure 3. 

The effects of sidelobe modulation of the interferometric spatial denial 
antennas are to produce a broad null along the end fire axis of the 
antenna. If the two bracketing antennas are orthogonal to the orientation 
of the radar antenna, then mainlobe radar emissions are unaffected by 
the spatial denial radiation, while the radar sidelobes are completely 
obscured by this same energy. The resultant effect is to prevent the non- 
cooperative receiver from sampling the radar emissions, and cohering to 
them. In Figure 4, we illustrate the radiated waveforms that emerge 
from a waveform diverse transmit aperture. 
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Figure 2. Interferometer Pattern. 




Figure 3. Spatial Denial Antenna Pattern. 
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Figure 4- Transmit Waveform Diversity. 




Figure 5. Typical Envelope of Radar Output. 



Here, multi-mission, multi-mode waveforms are transmitted simul- 
taneously in different directions, accomplishing different objectives, all 
without mutual interference. 

4. Signal Processing 

To achieve adequate detection in noise, the radar engineer must con- 
sider the minimum detectable signal given the radar receiver character- 
istics, the required SNR given the detection criteria and the radar range 
equation. The minimum detectable signal is determined by the ability 
of a receiver to detect a weak radar return as compared to the noise en- 
ergy that occupies the same portion of the frequency band as the signal 
energy. This is illustrated in Figure 5. 

The root mean-square (RMS) value of the in-band noise establishes 
the absolute limit to sensitivity exhibited by a radar receiver. Due to 
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Figure 6. Effects of Target on Distribution Function. 

the random nature of noise, the receiver threshold level is often many 
decibels (dB) above the RMS noise level. The presence of a target, often 
modeled as a constant signal, will alter the statistics of the received 
signal by shifting the mean value of the baseband voltage waveform. 
This shift in mean value is what permits detection processing using the 
various criteria that have emerged including Neyman-Pearson, Bayes, 
etc. Figure 6 illustrates the effect of a target on the distribution of a 
received signal. 

One problem in signal processing is false alarms due to thermal noise 
fluctuations. The ultimate tradeoff is between detection probability, 
which is a function of the range equation and the threshold level of 
the receiver, and the false alarm rate, which is a function of the noise 
statistics and the threshold level. The common element is the threshold 
setting. The probability of detection for a sine wave in noise as a function 
of the signal-to-noise (power) ratio and the probability of false alarm are 
presented in Figure 7. 

The false alarm rate for wide area surveillance radars must be small 
because there are so many range, angle, Doppler biases with the potential 
for false alarms. For a 1 MHz bandwidth, there are on the order of 10® 
noise pulses per second. Hence, the false alarm probability of any pulse 
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Figure 7. Probability of Detection vs. SNR. 
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must be small, less than one in a million. Non Gaussian clutter is more 
complicated and will further compound the false alarm control problem. 
In radar, we consider the classical hypothesis testing approach to be 
most appropriate. We have Type I errors, which are false alarms, and 
Type II errors, which are missed detections. In the Neyman-Pearson 
receiver, we fix the probability of a Type I error (P/) and minimize the 
probability of a Type II error (P//). We can consider several aspects of 
the detection problem with an ideal observer, by maximizing the total 
probability of a correct decision or minimizing the probability of error 
P(e), given by 



P(e) = PnPi + Ps+nPii (9) 

where Ptv is the a priori probability of noise only present, Ps+ is the a 
priori probability of a signal plus noise crossing the threshold, and P/, 
Pji were defined above. 

In the sequential observer, we fix the probability of error a priori and 
allow for the integration gain to vary. Of course, in all of the previ- 
ous discussions, it was assumed that the likelihood ratio test is used to 
analyze measured radar data. 

If an event has actually occurred, the problem of forming the best 
estimate of the source of the event uses Bayes rule, which states 
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In signal processing, clutter is defined as unwanted radar returns from 
ground, sea, precipitation, birds, etc. The clutter parameters of interest 
in signal processing algorithm development include amplitude distribu- 
tion, spatial extent, temporal stability, radar parameter dependence, etc. 

Clutter characteristics must be understood in order to optimally se- 
lect radar waveform parameters, as well as signal processing algorithms 
designed to separate target returns from interference. Finally, the signal 
plus any residual clutter energy must be tested for target present /target 
absent. The basic assumption here is that adequate clutter rejection 
has been achieved such that statistical hypothesis testing against ther- 
mal noise is adequate. In realistic radar environments, that is seldom 
the case, and the detector circuit must be altered to adequately address 
the impact of clutter residue on detection performance. Clutter residue 
out of a simple two pulse canceller can be impacted by amplitude and 
phase errors in the radar system. Figure 8 illustrate these effects, where 
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Figure 8. Cancellation Ratio 



clutter residue (CR) is plotted as a function of RMS amplitude ripple 
(error) versus RMS phase ripple. As an example, an amplitude error 
of 0.1 dB limits the signal processor CR to less than 40 dB. If the in- 
terfering ground clutter is very strong, which it typically is in modern 
airborne radar, and exceeds the target return by more than 40 dB, then 
automatic detection processors will not be able to distinguish between 
threat targets and clutter residue, rendering the radar useless in this 
case. 

5. Space-Time Adaptive Processing 

In radar signal processing research, modern Space-Time Adaptive 
Processing (STAP) developments incorporate numerous disciplines, in- 
cluding applied statistics, linear algebra, software engineering, advanced 
computing technology, transmit receive module technology, waveform 
generation, timing and control, antennas, and system engineering. Each 
of these topics requires years of study to become an expert in any one 
area. This section focuses on a systems engineering approach to under- 
standing the need for STAP in modern radar. 

Consider an airborne wide area surveillance radar with a transmit 
mainlobe pointing broadside (perpendicular to the velocity vector of the 
airborne radar). In this example, an airborne threat target is approach- 
ing the radar in the general direction of the broad transmit mainbeam. 
Our goal is to separate the target return from ground clutter returns. 
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The spectral spread of ground clutter is governed by the equation 

/, = ^cos(0). (11) 

In this equation, fd is the Doppler offset of clutter arriving from the 
angle q, where q is measured with respect to the velocity vector of the 
airborne radar, v is the velocity of the radar platform, and A is the radar 
wavelength and is proportional to the radar frequency /c, through 



A = c//, 



(12) 



where c is the speed of light. For a given angle 9i, the clutter Doppler 
frequency is given by 

2v 

fdi = y cos(6»i) . 



If a mainlobe target is approaching the radar at radial velocity , then 
the target Doppler is 



fd2 



2v2 



If fd 2 = fdi, then sidelobe clutter from angle 9i is going to compete 
with mainlobe target traveling with radial velocity V 2 - If the sidelobe 
clutter is effectively a much larger scattering center than the mainlobe 
airborne target, then the detection process could easily produce a type II 
error and the threat target will go unreported. One approach to rejecting 
clutter competing with mainlobe targets is to lower the sidelobes of 
the antenna to the point where unwanted interference is suppressed to 
below the thermal noise floor of the receiver. Then only mainlobe clutter 
will remain, and these unwanted returns can be rejected by Doppler 
processing. However, this is unrealistic. 

In order to produce extremely low sidelobes, an antenna aperture 
would be very large. Additionally, the antenna would have to operate 
in the far field of any large scattering center. In modern airborne radar, 
this is unrealistic. 

If the radial velocity of the threat target were known a priori, then a 
receive antenna pattern and a platform velocity vector could be selected 
to place a null on the clutter that would compete with the target in the 
detection process. However, this is unrealistic, even under the simplest 
conditions. The only way to accomplish the task of detecting targets 
and rejecting interference is to adaptively place nulls in the sidelobe 
pattern to optimize performance. However, since angle-Doppler coupling 
ties sidelobe clutter position to clutter Doppler frequency offset, the 
adaptive nulling problem is not one-dimensional (1-D) (angle) but two- 
dimensional (2-D) (angle-Doppler). 
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Azimuth Projection 



Figure 9. Angle-Doppler Profile. 

In radar, angle and Doppler are derived variables. In a measurement 
system, data is collected in the space and time domains. It is impor- 
tant to note that space and angle are related by the Fourier transform, 
as are time and Doppler. This fact is exploited in STAP-based signal 
processing, and is illustrated in Figure 9. 

As illustrated in Figure 9, mainlobe clutter and targets are widely 
separated in theory. In practice, the fundamental issue is leakage sup- 
pression among Doppler filters, or antenna beams. In space-time adap- 
tive processing, the simplest approach to clutter rejection and target 
detection is through application of the sample matrix inversion (SMI) 
algorithm, which can be written as 

Irt = s^R^^xo , (13) 

where s is the steering vector in angle, Doppler, or both, xq is the 
measured data to be analyzed for target present [Hi] or target absent 
[Hq], and Rfc is the sample covariance matrix, where 

1 ^ 

Rfc = ^ X] ■ (14) 

k=l 

The training data x^, k = 1, 2, . . . , A', are selected from the immediate 
region where the data from the cell under test, xq, is collected. This helps 
to meet the independent, identically distributed assumption common in 



70 



COMPUTATIONAL NONCOMMUTATIVE ALGEBRA 



the literature. The measurement data, x*,, are also assumed to be zero- 
mean complex Gaussian data vectors under the null hypothesis. In this 
simple test, if the sample covariance matrix is the identity matrix I, 
then the SMI test reduces to the discrete Fourier transform. However, 
Rfc will only be an identity matrix under the assumption that the clutter 
data vectors, x^, k = 1,2, . . . , K, are all white noise vectors, and K is 
large. More realistically, is not diagonal. 

In order to simplify the processing in STAP based radar, the two- 
dimensional adaptive processing is decomposed into two one-dimensional 
processes. In radar, it is logical to perform Doppler filtering of the time- 
domain data from each channel in a STAP based radar first, and then 
apply the SMI algorithm in the spatial domain. As such, the adaptive 
degrees of freedom, i.e., the size of the steering vector s, is substantially 
reduced. Furthermore, the size of the sample covariance matrix is re- 
duced as is the number of samples required to adequately estimate the 
covariance matrix. 

In the 1980’s, Kelly from MIT Lincoln Laboratory revisited the theo- 
retical development that led to the SMI algorithm. The likelihood ratio 
test was developed under the Gaussian assumption where the data vec- 
tors were zero-mean under the null hypothesis. Additionally, the SMI 
algorithm was developed under the assumption of infinite iid training 
data, K ^ oo, for sample covariance matrix estimation, and that the 
sample covariance converges to the true covariance matrix as the number 
of samples tends towards infinity. 

In Kelly’s development, he applied one more condition to the like- 
lihood ratio test, namely finite training data K. The generalization 
resulted in a new test statistic given by 



Irt 



s'^R ^xq 

^s^R“^s^ -|- x^R“^x^ 



(15) 



Kelly’s generalized likelihood ratio test exhibits an embedded Gon- 
stant False Alarm Rate (GFAR) characteristic, meaning that the prob- 
ability of type I error is fixed. Additionally, it is important to note that 
the presence of a T2-test in the denominator prohibits the separation of 
the filter function from the false alarm control function in this formula. 
As such, the separation of filter function from false alarm control is not 
possible. A further modification to the SMI algorithm is to incorporate 
the effects of angle-Doppler coupling (structure in the covariance ma- 
trix) into its development. This could be accomplished by conditioning 
in angle-Doppler coupling in the probability P^ih.{x / H i), i = 0, 1, used 
in the formulation of the likelihood ratio test. Once this is accomplished. 
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the effects of angle-Doppler coupling on other statistical tests are easily 
established. 

6. Four Problems in Radar 

As analog hardware performance matures to a steady plateau, and 
Moore’s Law provides for a predictable improvement in throughput and 
memory, it is only the advances in signal and data processing algorithms 
that offer potential for performance improvements in fielded sensor sys- 
tems. However, it requires a revolution in system design and signal 
processing algorithms to dramatically alter the traditional architectures 
and concepts of operation. One important aspect of our current re- 
search emphasizes new and innovative sensors that are electrically small 
(on the order of 10 wavelengths or less), and operate in concert with 
a number of other electrically small sensor systems within a wide field 
of view (FOV). Our objective is to distribute the power and the aper- 
ture of the conventional wide area surveillance radar among a number 
of widely disbursed assets throughout the battlefield environment. Of 
course, we must have an algorithm for distributing those assets in real 
time as the dynamically changing surveillance demands. The mathe- 
matical challenge here relates to the traveling salesman problem. Clas- 
sically, the traveling salesman must select his route judiciously in order 
to maximize potential sales. Recent analysis in the literature addresses 
multiples salesmen covering the same territory. This is analogous to our 
problem, where multiple unmanned aerial vehicle (UAV) based sensors 
are charged with the mission of detecting, tracking, and identifying all 
targets (friend or foe). Not only must these sensors detect and identify 
threat targets, they must also process data coherently across multiple 
platforms. Our mathematical challenge problem reduces to one in which 
the position and velocity all UAV-based sensors are selected to maximize 
detection performance and coverage area, and minimize revisit rate. 

Enhancing one of the sensors described above, to be more like a clas- 
sical radar with a large power-aperture product, leads to the second 
mathematical challenge problem to be addressed by this community. 
With a larger aperture and more precise estimates of target parame- 
ters (angle, Doppler), an opportunity to expand the hypothesis testing 
problem to include both detection and estimation emerges. Here, con- 
ventional wisdom dictates that we perform filtering and false alarm rate 
control as part of the detection process, yet perform track processing 
as a post-detection analysis, where the parameter estimation is focused 
upon target position and velocity history. Clearly, parameter estimation 
need not be accomplished as a post-detection process. Since this seg- 
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merited approach to detection and track processing has been in effect for 
decades, it will require a dramatic demonstration of improvement before 
it will be embraced by the radar community. 

A third challenge problem arises in the formulation of the Generalized 
Likelihood Ratio Test (GLRT). In Kelly’s formulation of a GLRT, con- 
ditioning on finite sample support is incorporated into the basic test. As 
such, a statistical method developed under the assumption that only fi- 
nite training data are available for sample covariance matrix formulation 
was made available. The next generalization to be made, in an extension 
of Kelly’s GLRT, is to incorporate prior knowledge of the structure of 
the sample covariance matrix into the mathematical development of a 
statistical test. This mathematical structure arises due to the fact that 
the phase spectra of ground clutter as seen by an airborne radar is de- 
termined only by geometry, and remains independent of the underlying 
clutter statistics (except for initial phase). The effect of this geometric 
dependence is to localize the interference along a contour in the trans- 
form domain (Fourier analysis). Our objective is to formulate a single 
GLRT which incorporates the effects of finite training data as well as 
geometric dependence. 

The fourth mathematical challenge facing the modern radar engineer 
is to incorporate adaptivity on transmit into the basic formulation of 
the signal processing algorithm. Since this is a new research topic, an 
opportunity exists to formulate the basic mathematical framework for 
fully adaptive radar on both transmit and receive. Further extensions 
arise by incorporating the above challenge problems into this analysis. 

7. Conclusions 

The rapid recent development in signal processing and waveform gen- 
eration, timing, and control, has led to opportunities for fielding a vast 
array of new radar systems. Among these new sensor suites, we find 
that challenging problems remain unaddressed. Most notable is the in- 
corporation of prior knowledge concerning the clutter environment into 
the likelihood ratio test. This work is expected to be a simple extension 
of Kelly’s work in the 1980’s. 

Of course, extension of the thresholding process to a “post tracker” 
implementation offers further opportunities for performance enhance- 
ments and mathematical formulations. A further enhancement to radar 
is obtained through application of a wide variety of radar signals to a 
diverse number of radars, all operating in concert in a battlefield envi- 
ronment. This gives rise to a fourth opportunity in radar, that deals 
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with the fielding and positioning of these radars, much like the traveling 
salesman problem. 

All of this leads to new opportunities to exploit advances in applied 
mathematics, physics, electronics, and computer engineering to advance 
the state-of-the art in radar. While radar is nearing its 100-th year 
as a patented product, the prospects for advancement are as great as 
ever. Our defining moment in radar may have occurred in World War 
II; our impact on society is yet to occur, thanks to the mathematicians, 
scientists, and engineers who continue to advance the technology. 
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“Look, ” they say, “here is something new!” But no, it has all happened before, 
long before we have were born. 

— Good News Bible, Eccl.UlO 



Abstract In this paper we develop two topics in parallel and show their inter- 
and crossrelation. The hrst centers on general notions of the classical 
signal/system theory on finite Abelian hypergroups. The second con- 
cerns the quantum hyperharmonic analysis of quantum signals (Her- 
mitean operators associated with classical signals). We study classical 
and quantum generalized convolution hypergroup algebras of classical 
and quantum signals. 

Keywords: classical and quantum signals/systems, classical and quantum Fourier 
transforms, Clifford algebra, hypergroups. 

Introduction 

The main F. Klein idea of the “Erlangen Program” lies in the corre- 
spondence of some group to a certain geometry. Thus, a group is the 
first (basic) notion of geometry and group can be interpreted as some 
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group of symmetries for the geometry. So, in general, every group of 
transformations (symmetries) determines its own geometry under the 
F. Klein correspondence GEO = /(GROUP). Quantum signal theory 
is a term referring to a collection of ideas and partial results, loosely 
held together, which assumes that there are deep connections between 
the worlds of quantum physics and classical signal/system theory, and 
that one should try to discover and develop these connections. The gen- 
eral topic of this paper is the following idea. If some algebraic structures 
arise together in quantum theory and classical signal/system theory in 
the same context, then one should try to make sense of this for more 
generalized algebraic structures. Here, the point is not to try to de- 
velop alternative theories as substitute models for quantum physics and 
signal/system theory, but rather to develop a “/3- version” of a unified 
scheme of general classical and quantum signal/system theory based on 
the F. Klein “Erlangen Program”. It is known that general building ele- 
ments of the Classical and Quantum Signal/System Theories (Cl-SST 
and Qu-SST) are the following: 1) the Abelian group of real numbers 
AR, 2) the classical Fourier transform IF, and 3) the complex field C, 
i.e., these theories are associated with the triple ((AR,.F, C)). Following 
F. Klein, we can write 

Cl-SST = fa (((AR, T, C))) , Qu-SST = (((AR, T, C))) 

for any F. Klein correspondences fa and fqu, respectively. These cor- 
respondences mean that every triple ((AR,.7^, C)) determines certain 
theories Cl-SST and Qu-SST. In this paper we develop a new uni- 
fied approach to the Generalized Classical and Quantum Signal/ System 
Theories (GCl-SST and GQu-SST). They are based not on the triple 
((AR, but rather on other Abelian groups and hypergroups, on a 

large class of orthogonal and unitary transforms (instead of the classical 
Fourier transform), and involve other fields, rings and algebras (triplet 
color algebra, multiplet multicolor algebra, hypercomplex commutative 
algebras, Clifford algebras). In our approach. Generalized Classical and 
Quantum Signal/System Theories are two functions (correspondences) 
of a new triple: 

GCl-SST = fa[mG,T,A))), GQu-SST = /,„(((HG,.A, A))) , 

where HG is a hypergroup, A" is a unitary transform, and A is an alge- 
bra. When the triple ((HG,JF, A)) is changed the theories GCl-SST 
and GQu-SST are changed too. For example, if J- is the classical 
Fourier transform, HG is the group of real numbers R and A is the 
complex field C, then ((R, A", C)) describes free quantum particles. If 
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T is the classical Walsh transform (CWT), HG is an abelian dyadic 
group Z2 and A is the complex field C, then ((Z2 , CWT, A)) describes 
n-digital quantum registers. If T is the classical Vilenkin transform 
(CVT), HG is an abelian m-adic group Z^ and A is the complex held 
C, then ((ZJ^, CWT, C)) describes n-digital quantum m-adic registers 
and so on. Every triple generates a wide class of classical and quantum 
signal processing methods. We develop these two topics in parallel and 
show their inter- and crossrelation. We study classical and quantum 
generalized convolution hypergroup algebras of signals and Hermitian 
operators. One of the main purposes of this paper is to demonstrate 
parallelism between the generalized classical hyperharmonic analysis and 
the generalized quantum hyperharmonic analysis. 

1. Generalized classical signal/system theory on 
hypergroups 

1.1 Generalized shift operators 

The integral transforms and the signal representation associated with 
them are important concepts in applied mathematics and in signal the- 
ory. The Fourier transform is certainly the best known of the integral 
transforms and, with the Laplace transform, is also the most useful. 
Since its introduction by Fourier in the early 1800s, it has found use in 
innumerable applications. However, the Fourier transform is just one of 
many ways of signal representation, there are many other transforms of 
interest. An important aspect of many of these representations is the 
possibility to extract relevant information from a signal: the informa- 
tion that is actually present but hidden in its complex representation. 
But these transformations are not efficient analysis tools compared to 
the ordinary Fourier representation, since the latter is based on such 
useful and powerful tools of signal theory as linear and nonlinear convo- 
lutions, classical and higher-order correlations, invariance with respect 
to shift, ambiguity and Wigner distributions, etc. The other integral 
representations have no such tools. The ordinary group shift operators 
{T[x){t) := x{t © r) play the leading role in all the properties and tools 
of the Fourier transform mentioned above. In order to develop for each 
orthogonal transform a similar wide set of tools and properties as the 
Fourier transform has, we associate a family of generalized commutative 
shift operators with each orthogonal transform. Such families form com- 
mutative hypergroups. Only in particular cases are these hypergroups 
well-known Abelian groups. In 1934 F. Marty [1, 2] and H.S. Wall 
[3, 4] independently introduced the notion of hypergroup. 
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Let f{x) : O — > A be an ^-valued signal, where A is an algebra. 
Usually, n = R” x T, or 17 = Z"" x T, where R”, 7A and are nD 
vector spaces over R, Z and Ztv, respectively, T is a compact (temporal) 
subset of R, Z, or Z^r. Here, R, Z and Z^r are the real field, the ring 
of integers, and the ring of integers modulo N, respectively. Let 17* be 
the space dual to 17. The first one will be called the spectral domain, 
the second one is called the signal domain keeping the original notion of 
X G 17 as “time” and w G 17* as “frequency” . Let 

Sigo = L(17,^) := {f{x)\ f{x) : 17 — > A} , 

Spo := L(17*,^) := {F{co)\F{uj) : 17* — > A} 

be two vector spaces of ^-valued functions. In the following we assume 
that the functions satisfy certain general properties so that pathological 
cases where formulas would not hold are avoided. Let {Tui{x)} 
be an orthonormal system of functions of SigQ. Then for any function 
f{x) G SigQ there exists a function F{uj) G Spg for which the following 
equations hold: 

F{uj) =CF{f]{uj) \= ( f{x)ifoj{x)dp,{x), (1) 

J 

f{x) =CF~^{F}{x) := f F{u})(foj{t)dn{uj), (2) 

where p,{x), are certain suitable measures on the signal and spectral 
domains, respectively. The function F{lo) is called the C.F-spectrum of a 
signal f{x) and expressions (1) and (2) are called the pair of generalized 
classical Fourier transforms (or C.F-transforms). In the following we 

will use the notation /(x) < — > in order to indicate C.F-transform 

CT 

pairs. Along with the “time” and “frequency” domains we will work with 
“time-time” 17 x 17, “time-frequency” 17 x 17*, “frequency-time” 17* x 17, 
and “frequency- frequency” 17* x 17* domains, and with four distributions, 
which are denoted by double letters ff(x,u) G L2(I7 x 17, A), Ff((j,x) G 
L2(I7* X 17, A), fF(x,i/) G L2(I7 x 17*, A), and FF(w,i/) G L2(I7* x 17, A). 

The classical shift operators in the “time” and “frequency” domains 
are defined as {Tff){x) := f{x + v), {D'fF){oj) := F{ui + v). For /(x) = 
^ and 

, i.e., functions e^^^, are eigen- 

functions of “time” -shift and “frequency” -shift operators Tf and Dfj 
corresponding to eigenvalues A„ = e^^'^ and respectively. We 

now generalize this result. 
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Definition 1 The operators 

{f^ip^){x) = {f^ip^){x) = ipu,{x)ipu,{v), (3) 

{DITu,){x)= (p^{x)(py{x), {D';^(p^){x)= (p^{x)^py{x). (4) 

are ealled the generalized eommutative “time” and “frequeney”- shift op- 
erators (GSOs) respeetively. 

It is known [5, 6] that two families of time GSOs {T^j^gnand frequency 
GSOs form two commutative hypergroups. By definition, 

functions (fuj(x) are eigenfunctions of GSOs: Tfip^{x) = Tuj{'^)Tuj{x\ 
D'f(pu){x) = f>y{x)(p^{x). For this reason, we can call them the hyperehar- 
aeters of the hypergroup. The idea of a hypercharacter on a hypergroup 
encompasses characters of locally compact and finite Abelian groups and 
multiplication formulas for classical orthogonal polynomials. The the- 
ory of GSOs was initiated by Levitan [5, 6] and (in the terminology of 
hypergroup) by DuNCL [7] and Jewett [8]. The class of commutative 
generalized translation hypergroups includes the class of locally com- 
pact and finite Abelian groups and semigroups. The theory for these 
hypergroups looks much like locally compact and finite Abelian group 
theory. We will show that many well-known harmonic analysis theo- 
rems extend to the commutative hypergroups associated with arbitrary 
Fourier transforms. 

For a signal f{x) G Sigg we define its shifted copy by 



Tff{x) ■=f{xmv) = T 




F{ijj)(p^{x)dpL{ijj) = 



j = j [F{uj)g:)^{v)]g:)^{x)dfi{uj). 

Analogously, 



Tffix) : 


:=f{xBv)= f 




J 




:= F{uj ®y)= / 




J 


KF{u;) 


:= F{lv Qv) = J 



[F{uj)ip^{v)]ip^{x)dij.{uj), 

[f{x)(f„{x)](f^{x)dp,{x), 

[f{x)ip^{x)](f^{x)dfi{x). 
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Here symbols ffl,© and B,0 are the quasisums and quasi differences, 
respectively. Obviously 

(p^{x Sv) = (Pcj(x)(p^(v), (p^{x B u) = (p^(x)(p^(v), 

and 

Toj®u{x) = (Pujix)ipu(x), TujQu{x) = if^{x)lp^{x). 

We will need the following modulation operators: 

{M^f)(x) := <fu{x)f{x), {MlF)u) := (p^{v)F{uj). 

{M^f){x) := ip^{x)f{x), {M^F)iv := ip^{v)F{uj). 

From the GSOs definition we have: 

Theorem 1 Shifts and modulations are connected as follows: 

fffix) = f{xmv) ^ F{co)ipUv) = M:F{co), 

cr 

fffix) = fixBv) ^ F{co)^Uv) = M:F{co), 

cr 

Mff{x) = f{x)(f^{x) i — > F{lv eu) = D'fF{w), 

CT 

Mff{x) = f{x)(f^{x) < — > F{lv Qv) = DlF{uj), 

CT 

i.e., 



CF{ff}CF-^ = MZ, 


CF{Mf}CF-^ = DZ, 


( 5 ) 


CF{ffX-^ = MZ, 


CF{MZ}CF~^ = DZ, 


( 6 ) 


CF-^{DZ}CF = MZ, 


CX{MZ}CF = fz, 


( 7 ) 


cf~Hdz}cf = mz, 


CF-\MZ}CFo = fZ. 


( 8 ) 



The operators are noncommutative because 

Mfff = ffMf = MvWfTf, 

MX = Mv)DX, DX = Mv)MX. 
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1.2 Some popular examples of GSOs 

Example 1 In this example we eonsider GSOs on finite eyelie groups. 
Let O = Z/N be an Abelian eyelie group. The ND veetor Hilbert spaee of 
elassieal diserete A-valued signals is Sigg = {f{x)\f{x) : Z/iV — > .4}. 
The eharaeters ofL/N are diserete harmonie A-valued signals Xoj{x) = 
where uj G fiL/N)* = Z/iV, and e is a primitive Nth root in an 
algebra A. They form a unitary basis in Sigg. The Fourier transform in 
Sigo is the diserete Fourier A-valued transform 

fix) = CTf,^F}= ^ 

uj&Z/N 

F{w) = CTN{f}= Yl 

x&L/N 

All Fourier speetra form the ND veetor Hilbert speetral spaee Spg = 
{F((j) I -F(tj) : Z/N — > .4}. The “time-frequeney” and ‘frequeney- 
time” domains are LlxD,* = D,xD = Z/iV x Z/N, i.e., the phase spaee 
is the 2D diserete torus "L/N x "L/N. The “time” and “frequeney”- shift 
operators Tf, D)f are defined byTff{x) := f{x®v), DfjF{u!) := /(w©p), 
where 




and © is the symbol representing addition modulo N. It is obvious that 
CJ ^ = M”, N = Alfi'^. Here, modulation op- 

erators Mf and M” are defined by Mff{x) := e’^^fix), Mf,F{uj) := 
e“^^F{u)), where 




The “time” -shift and “frequeney ’’-shift operators induee the following 
pair of sets of noneommutative Heisenberg-Weyl operators: 

HW^ := = Mfff I 1 / G Z/N, v G Z/lv} , 
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HW^ := = MX I V e V G Z/iv} . 

They act on Sigg and Spg by the following rules: 

:= MX fix) = e-^fix © v), 

e^X'^Fiu) := MXFi^) = © ^)- 

□ 

Example 2 Let On o,nd he two versions of a finite Abelian group of 
order N := A^iA ^2 • • • -^n- The fundamental structure theorem for finite 
Abelian groups implies that we may write On and OJ^ as the direct sums 

m m 

of cyclic groups, i.e., On = 0Z/A^j, and OJ^ = 0Z*/iV/, where both 

1=1 1=1 

h/Ni and JA /Ni are identified with the integers 0, 1, . . . , — 1 under 

addition modulo Ni. Group elements x G On and uj G OJ^ are identified 
with points x = (xi,X 2 , ■ ■ ■ ,Xm) and to = (cji, tJ 2 , • • • , <^m) of the mD 
discrete torus, respectively. Let us embed finite groups On and OJ^ into 
two discrete segments On — > O := [0, N — 1], O^ — > O* := [0, N — 1]* 
using a mixed-radix number system 



X = 







The weights of x\ and uji are unity (Nq = 1.) The group addition induces 
“exotic” shifts in the segments O := [0,N — 1] and O* := [0,N — 1]*, 
which we will denote as If x = {x\, . . . ,Xm)-, v = {vi, . . . ,Vm) and 

N 

LV = {u)i , . . . ,Wm), P = {vi , . . . then 

X®V = (xi, . . . ,Xm)©('yi, ...,Vm) = (xi©i;i, ...,Xm®Vm) 

N N Nl Nrr, 



and 



LO®U= {iOi,...,UJm)®{ni,...,Um) = {iOi® Ui, . . . , (jJm ® X ■ 

N N Ni Nm 

The Fourier transforms in the space of all A-valued signals defined on 
the finite Abelian group On = Z/A^i x Z/iV 2 x . . . x Z/iV^ in the form of 
O = [0, N — 1] have a great interest for digital signal processing. Denote 
this space by Sigg = L(O,0. Let e^i be a primitive A-valued Ni-th root. 
The set of all characters of the group On can be described by Xui{x) = 
Xujiixi) ■ ■ ■ XuJmi^m) = They form an orthogonal basis 
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in the signal space L(0,^). The Fourier transform of a signal f{x) G 
L(J7, is defined as 

F{u) = CJ^n{/}(w) = X] f(^)Xco{x), uj gXI*. (9) 

ten 

The inverse Fourier transform is 

f{x) = = ^ Y1 t G n. (10) 

ojeri* 



The set of all functions F{oj) forms the spectral space Spg = L(0* ^). 
The “time” and “frequency” -shift operators Tf, are defined by 



Tff{x) := f{xev), Df,F{uj) := f{u;®n), 







N N 


where 


rpV rp{v\.,V2i- 

X ~ (xi,X2v 




and 


x> (U!1,U!2,. 


IXi) = os ® os ® ■ ■ ■ ® Kz 


Obviously, 







= m: 



Here, modulation operators Mf and M” are defined by Mff{x) := 
Xuj{x)f{x) and Mf,F{uj) := Xui{x)F{oj), where 



Mf = 

X (Xl,X2,..;Xm) 



XI X2 X777, 



and 



Mf = = M”\ ® M2 






UI2 






□ 



Example 3 Let O = [a,b], O* = {0,1,2,...} := Nq, and let (pk{t) = 
Pk{t) be a family of classical orthogonal polynomials. Then 




f{t)Pk{t)g{t)dt, 



fit) :='^hi^^F{k)pk{t) 
k=0 



( 11 ) 



is the pair of generalized Fourier transforms, where k G Nq, t G [a, 6], 
g{t)dt = dqi{t) and dfi{k) = are measures on the signal and spectral 
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domains respectively. We consider special cases of classical orthogonal 
polynomials. Case 1. Let Ul = [— 1,+1], g{t) = (1 — t)“ (1 + t)^, 
a > P — 1 and Jac^^'^\f) he {a, P)- Jacobi polynomials. In this case, 
generalized Fourier transforms for each a and P are the Fourier- Jacobi 
transforms 

+1 

i^,/3)F{k) = = J /(t) - t)“(l + tfdt, (12) 

-1 



f{t) := ^‘^’^'>CJ^-\F}{k)='^hf^ ^^’f^^F{k)Jac^j^’^\t) (13) 

k=0 

for special constants h^. If a> P > —^ then the multiplication formula 
for {a, P)- Jacobi polynomials is 



Jac^^’^\r)Jact^\t) = Pl^’^\tmr) 



1 7T 




0 0 



i(l + r)(l + t)+ 



-(1 — r)(l — t)s^ + Y^(l — t2)(1 — P)scos 6—1 



dfi{s,6), (14) 



where dp.{s,6) = ~ ^ ^s^^+^(sin 0)^^cis(i0. There 

follows 



1 7T 



(T,V)(t) = /(tfflr) = J J f 

0 0 



-(l + r)(l+t) + 



+ -(1 — r)(l — t)s^ + Y^(l — t 2)(1 — t^)scos 6 — 1 



dfj,{s,6). (15) 



Case 2. If a = P = 0 then {Jac^k ’^\t)}^^Q = {Legk{t)}^^Q is the 
Legendre basis. From (15) we obtain the Legendre CSOs 

1 

{Tlf){t) = fit^T) = ^ J f(rt + ^/{l- r2)(l - P)s"j (1 - s^y^/^ds, 

(16) 

associted with the Legendre transform. Case 3. If a = P = —0.5, then 
{Ja4 ^ = {C^fc(^)}^o Legendre basis. In this case, 

H = (—1, 1), g{t) = (1 — /lo = f , /in = 7T, n G No = Li* . For the 

Chebyshev polynomials the following multiplication formula is known: 



Chn{T)Chn{t) = Chn{t ffl t) = 
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1 r 

2 L 

Therefore, 



Chn { Tt + V^(l - r2)(l - + Chn \^Tt - V^(l - 

(r;/)(t) = /(tfflr) = 



1 



f (tT+ V^(l -t 2)(1 -t2)) + / [ tT _ V^(l - t2)(1 - 



(17) 



(18) 

□ 



Example 4 Finally, we eonsider the infinite interval H = (— oo,+oo). 
Let us introduee the signal and the speetrum spaees 





+ 00 




\f{t)\‘^w{t)dt < oo 



L 2 (N,C,^„) = I F{n) (^F{n) : N — > C^t(^'^Wn\F{n)f < oo^ I , 



nSN 



with the sealar produets 
+ 00 



(/,5):= j f{t)g{t)e-^"/^ dt, (F, G) = — 4^F(n)G(n), 

_-G nSN 



where Ll* = N = {0,1,2,...,}, dp,{t) = w{t)dt, w{t) = e and 
Wn = l/2"n!-y/F. In this ease, the generalized elassieal Fourier transform 

of a signal f{t) G L 2 ("k, C, is the Fourier-Hermite transform 



F{n)=CHf}{n) 




f{t)Hern{t)e dt, 



where 

00 - 

f{t) = CT-\F]{n) = Y, 

n=0 ' 

where HeVnit) are Hermite polynomials. Sinee 



Herk{t)Herk{T) = Herk{t ffl r) 



(-l)^r(A: + (3/2))2^+i 

7^ 



Hevk 



{t^ + + 2tr cos exp(— tr sin tp) sin (/?Jo(tr sin tp)dg:> 

(19) 
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then 

T[f{t) = f{tmr) = 

f _|_ 7-2 _|_ 2tT cos (p'j sirup jQ{tTsmip)d(f, ( 20 ) 

where Jo(-) is the Bessel funetion. □ 

1.3 Generalized convolutions and correlations 

It is well known that stationary linear dynamic systems (LDS) are de- 
scribed by convolution integrals. Using the GSO notion, we can formally 
generalize the notions of convolution and correlation [11]-[18]. 

Definition 2 The following funetions 



y{x) := (/iO/)(x) 


= J h{v)f{xBv)dp{v), 

vefi 


(21) 


Y{w) := {H^F){io) 


= J H{u)F{uJ Q u)dfi{u) 


(22) 






are ealled the <0 and -eonvolutions respeetively. 

The spaces Sigg and Spg equipped with multiplications <0 and U* form 
commutative signal and spectral convolution algebras ((SigQ,<C>)) and 
((Spg,^)), respectively. 

Definition 3 The expressions 



if^g){v) 


■= / f{x)g{xBv)dp{x), 


(23) 




J 




{F4ltG)in) 


:= J F{uj)G{uj © v)dp{uj) 


(24) 



ijj&n* 



are referred to as the eross Jl» and 4ft -eorrelation funetions of signals f,g 
and speetra F, G, respeetively. If f = g and F = G, then the erosseorre- 
lation funetions are ealled the and 4ft-autoeorrelation funetions. 

The measures indicating the similarity between /T-distributions and 
^/-distributions and their time and frequency-shifted versions are their 
crosscorrelation functions. 

Definition 4 The expressions 

J fF{x,uj)gG{xSv,ujQn)dfi{x)dfi{u}), (25) 



ifFJI,4l^gG){v,n) := J 
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{F f^^Gg){u,v) \= j j F f{uj,t)Gg{ijjQv,tUv)dg,{x)dg,{ijj). (26) 

are referred to as the and -erosseorrelation funetions of the dis- 
tributions respeetively . If fF{x,uo) = gG{x,to) and Ff{uj,t) = Gg{uj,t), 
then the erosseorrelation funetions are ealled the autoeorrelation fune- 
tions. 

Theorem 2 Generalized elassieal Fourier transforms (1) and (2) map 
0 and ’Q-eonvolutions and Jft and 4f»-eorrelations into the produets of 
speetra and signals, respeetively, 

CF{h<>f} = CF{h}CF{f)} , CF~^ {H^F} = CT~^ {F} 

CT =CF{f} W{^, CF~^ {F4G} = CF~^ {F} CF~^ {G} 

Taking special forms of the GSOs, one can obtain known types of con- 
volutions and crosscorrelations: arithmetic, cyclic, dyadic, m-adic, etc. 
Signal and spectral algebras have many of the properties associated with 
classical group convolution algebras. Many of them are catalogued in 
[9]-[19]. 

1.4 Generalized ambiguity functions and Wigner 
distributions 

The Wigner distribution was introduced in 1932 by E. Wigner [20] 
in the context of quantum mechanics. There he defined the probability 
distribution function of simultaneous values of the spatial coordinates 
and impulses. Wigner’s idea was introduced in signal analysis in 1948 
by J. ViLLE [21], but it did not receive much attention there until 1953 
when P. Woodward [22] reformulated it in the context of radar theory. 
Woodward proposed treating the question of radar signal ambiguity as 
part of the question of target resolution. For that, he introduced a 
function that described the correlation between a radar signal and its 
Doppler-shifted and time-translated version: 

+ 00 

AW“[/](i^,u) = f /(x)/(x - = (jjr{//“(x,u)}, 

J X^V 

—oo 

where //“(x, u) := f{x)f{x — v). The distribution AW“[/](i/, x) is called 
the asymmetrie Woodward ambiguity funetion. It describes the local 
ambiguity of locating targets in range (time delay v) and in velocity 
(Doppler frequency n) . Its absolute value is called the uneertainty fune- 
tion since it is related to the uneertainty prineiple of radar signals. 
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The next time-frequency distribution is the so-called symmetric Wood- 
ward ambiguity function: 

AW^[f]{n,v) := {/ + |) / “ |) } = (27) 

where ff^{x,v) := / (x -|- |) / (x — . Analogously, we have expres- 

sions for computing AW^[T](x,j^) in the frequency domain 

AW<^[F]{n,v) = c^-i{F(cu)F((u - n)} = cjr-i{FF<^{n,w)}, 

AW^[F]{n,v) = cf-^{F L + ’^) F L - ’^)} = cjr-i{FF%n,co)}. 

If F = CF{/}, then from Parseval’s relation we obtain 

AW“[/](jy,x) = AW“[F](z/,x) and AW"[/](z/, x) = AW"[F](zy, x). 

For this reason, we shall denote AW“[/](z^, x), AW“[F](z/, x) by AW“(z/, x) 
and AW^[/](z^, x), AW^[F](z^, x) by AW^{n,v). Further, we use the sym- 
bol AW{n,v) for both AW“(z/, x) and AW^(i^, x). 

Important examples of time-frequency distributions are the so-called 
asymmetrical and symmetrical Wigner-Ville distributions. They can 
be defined as the 2D symplectic Fourier transform of AW“[/](z^, x) and 
AW^ [/] (z/, x) , respectively, 

WV“[/](x,cx) =c^-iC^{AW“[/](z.,x)} = /(x)FMe-^-^ (28) 

U)^V 

oo 

WV^[/](x,cx) =(jjr-i(-jr{AW^[/](z^,x)}= [ f (x+^) f (x-^)e~^^^dv. 

— OO 

(29) 

The 2D symplectic Fourier transform in (29) and (28) can be also viewed 
as two sequentially performed ID transforms with respect to x and v. The 
transform with respect to v yields the temporal autocorrelation functions 

fF(x,x) =cjr-i{AW^[/](z/,x)} = /(x)/(x-x), 

ff*(x,x) =CF“UAW"[/](z/,x)} = / (® + I) / (^ - I) • 

The transform with respect to n yields the frequency autocorrelation 
functions 



FF“(z/,(x) = (jjr{AW"[F](zy,x)}=F((x)F(w-z^), 
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ig{x,v) 




FG(i/,w) 



Figure 1. Diagram of relations between the different generalized 2D distribntions 



FF^(^z,o;) = cAAW^[F]{u,v)} = F (a; + 0 F (o; - 0 . 

We can formally generalize the notions of cross-ambiguity functions and 
Wigner-Ville distributions using the GSO notion. 

Definition 5 The symmetric and asymmetric generalized Woodward 
distributions (cross-ambiguity functions) of two signals f,g and two spec- 
tra F, G are defined by 

^W^[f,g]{j^,v) = CF{fs‘'] = J Tn{x)dfi{x), 

AW^[F,G\{u,v)=c^^-j{FGj= I [F(w®’QG(we’Q]TUv)dKu,), 
AW^[f,g]{i^,v) = CFlfg''} = f f{x)g{xBv) fiu{x)dn{x), 

xGQ 

AW^[F,G]{u,v) = cf-^{fG^}= [ \F{u;)G{u;Qu)\g,Uv)dfi{u;). 
r)<— oj ^ ^ J ^ 

ojefi* 

Definition 6 The generalized symmetric and asymmetric Wigner-Ville 
distributions of two signals f, g and two spectra F, G are defined by 

WW[f,g]{x,uj) = CF { fg"} = j Toj{v)dg{v), 
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Wr[F,G]{x,u;) =cjr-i{FG^}= / 

X-t—U J 






F \ iO ^ — \G (u^©“r 



2J\ 



cpu{x)d^{i'), 



WV[f,g]{x,(jj) := CF = f{x)F{uj)ip^{x), 

Wr[F,G]{x,u;) := cf-HFG-} = H^)fix)TUx). 

Figure 1 is a flowchart relating 

AW[f,g]{u,v), AW[F,G]{u,v) 

and 

W\[f,g]{x,u;), W\[F,G]{x,u;) 

and 

fg{x,v), FG{iy,uj). 

We can construct two vector Hilbert spaces of “time-frequency” and 
“frequency-time” distributions 

WV := {WV(x,w) I WV(x,u;) : Ul x Ul* — > A}, 



AW := {AW{i^,v) I AW{iy,v) : x H — > A}. 

Definition 7 The generalized Woodward-Gabor ambiguity transforms 
(or short-time and short- frequeney generalized Fourier transforms) WGg 
and WGg assoeiated with funetions g and G are defined as the following 
mappings: 

WGg: L(H,A) — >L(H*xH,A), WGg- — >L(H*xH,A) 

given by 

■= AW[f,g]{iz,v), WgG{F}{n,v) := AW[F,G]{v,iy). 

Definition 8 The generalized Wigner-Ville transforms WVg andWVG 
assoeiated with funetions g and G are defined as mappings 

WVg : L(D,A) — >L(DxD*,A), WVg ■ L(D*,A) — >L(DxD*,A) 

given by 

WVg{f}{x,u;) := WF[/, cu), WVg{F}{x,u;) := WV[F,G]{x,w). 
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2. Generalized quantum signal/system theory 
on hypergroups 

2.1 Basic definitions 

The basic objects of quantum harmonic analysis (QHA) are related 
not to classical signals and spectra /, FJaut to quantum signals and 
quantum spectra (Hermitian operators) /, F associated with classical 
signals and spectra as follows: 

/ ^ AW[/] ^ /, AW[F] ^ F. 

These maps are called the Weyl quantizations of signals and spectra, 
respectively. There are also the Schwinger quantizations using Wigner- 
Ville distributions: 

/ ^ WV[/] ^ /, F^ WV[F] ^ F. 

The functions AW[f]{u,v), AW[F]{v,v) (or WY[f]{x,uj), WY[F]{x,co)) 
are called the symbols (a symbol is not a kernel) of the quantum signal 
/ and the quantum spectra F, respectively, and are denoted by 

AW[/](jv,u) := sym{f}, AW[F]{v,u) := sym{F}, 

or 

WY[f]{x,uj) := sym{f}, WV[F](w,x) := sym{F}. 

Vice versa, a quantum signal / and quantum spectra F are called the 
operators associated with a classical signal f and classical spectrum by 
symbols AW[/], AW[F] (or by WV[/], WV[F]), respectively, and they 
are denoted by 

/ := Op{AW[f]}, F := Op{AW[F]}, 
or 

/ := Op{WY[f]}, F := Op{WY[F]}. 

All quantum signals / and quantum spectra F form the following quan- 
tum spaces: 

Sig;^ :={/!/ are operators acting in L2(0, A)}, 

Spi := {F I F are operators acting in L2(0*, A)}. 

Let Sig]^ and Spi be the spaces of quantum signals and quantum 
spectra with the following scalar products and norms: 

(/1I/2) := Tr(hfl), imi := {f\f) = Tr(//^), 
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(Fi|f 1) := Tr(Fii?t), ||F|| := {F\F) = Tr(FF^), 
where Tr(.) denotes the trace. 

Definition 9 The spaces Sigj^, Spj^ with the scalar products (/1I/2), 
(F 1 IF 2 ) and norms \ \f\\, ||F|| are called the Hilbert-Liouville spaces. 

Let {^aIasA s-iid {V’aIasA be two A-parametric families of operators, 
parametrized by the label A = (Ai, A 2 , • • • , A^) G A C R'" of a subset A of 
an rD space endowed with a suitable measure p{X). These families 
are called the quora for any subalgebras Algi C Sig^ and Alg^ C Sp^, 
if every quantum signal / G Algi and quantum spectrum F G Alg^ is 
determined by all scalar products 

FT(A) = (/I^a) = Tr{flp{), TF(A) = (F|^a) =Tr(F^l ) 



for all (px and i/’A- The fundamental property of the quora is that any 
quantum signal and spectrum can be expressed as integral transforms 



/ = Q^-1{FT(A)} = 


[ FT{X)pxdp{X) = Op{FT(A)}, 


(30) 


AeA 




F = Q.F-HtF(A)} = 


[ TF{X)ipxdpW = Op{TF(A)}, 


(31) 




AeA 




where 

FT(A) = QT{f} 


= (.^^a) = Tr(/^^) = sym{f}, 


(32) 


TF(A) = QHf} 


■= (/IV^) = Tt(/^a) = sym{F}. 


(33) 



Usually, FT(A) and TF(A) are Wigner-Ville distribution and Woodward 
ambiguity functions, respectively. Further we shall use only Woodward 
ambiguity functions to design quantum signals and spectra. 



Definition 10 Let {^a}asA and {V^aIasA two r-parametric families 
of operators. Then transforms (30)-(33) are called the abstract quantum 
Fourier transforms for the algebras Algi C Sigi and Algi* C Spi 
associated with two quora {^a}asA and {V'aIasA, respectively. 



2.2 Classical Weyl quantization 

It is well known that for the classical shift we have 
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I 



OO 



E 



{iv)^ 

kl 




OO 



E 



kl 



/(x) = e-®^/(x), (34) 



where T>x = This expression represents the decomposition of the 

ordinary finite shift into a series of powers of the differential opera- 
tor ^ and is called the infinitesimal representation of translation shift. 

Analogously, we can obtain D^F{uj) = F{uj + u) = where 

= —i-£^- Hence, In 1932 H. Weyl proposed 

[23] to modify the Fourier transform formula by changing its eomplex- 
valued harmonies into operator-valued harmonies. He used the following 
three quora for his quantization procedures of the signal space Sigg : 



{4 



^,v] — giCMx-eVDj;) 



}. {4- 



v) _ ^inMx 



}■ {4 



r,v) _ fi,vT>x ^nMx 



associated with the classical Fourier transform, where multiplication 
and differential operators are given by 



Mxf{x) 



xf{x), T>xf{x) := 



dx 



Using the first quorum, H. Weyl wrote any quantum signal / G Sig^ as 

/:= QJ^x{AW[f]] = 



Op{aw[/]} 



AW[f]{n, dfi{n)dn{v) , (35) 



ly&U* v&Q 



where 



AW[f]{n,v) = QF-^{AW[f]} = Sym{f} = Tr\f ^ 

'(36) 

Transformations (35) and (36) are called the direct and inverse ordinary 
quantum Fourier transform^m^ihe quantum signal space. It is natural 
to view maps AW[/j — > /, / — > AW[/j as operator-valued Fourier 
transforms. But we can write them in the explicit form of the integral 
kernels. For example, for the map AW[/j — > / the kernel f{x, y) of the 
operator / has the form: 



f{x,y)= j AW[/](jv,y 
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Of course, for quantization of the spectral space Spg one can use three 
dual quora 



iv,u] _ ^i[vM^+ut>A 



_ ^ivMu 



where := u>F{uj), V^^Fiijj) := Using the first quorum, 

we can write any quantum spectrum F G Spj^ as follows: 



F := Q^^{aW[F]} = 



J AW[F]{v,u)F^^^-+''^-Ufi{v)dn{u), (37) 

where 

AW[F]{v,u) = QF~^{AW[F]} = Sym{F} = Tr [f , 

(38) 

Transformations (37) and (38) are called the direct and inverse ordinary 
quantum Fourier transforms in the quantum spectral space. 

2.3 Generalized Heisenberg— Weyl operators 

Let us construct generalized operator-valued hyperharmonics associ- 
ated with an orthogonal basis {<^Lj(x)}tjgQ* . 

Definition 11 Operator T>x, for whieh T>x(puj{x) = uJipoj{x) is valid, is 
ealled the generalized differential operator. 

The generalized differential operator appears as an ordinary differen- 
tial operator with variable coefficients, for example Fx = P2{x)-^ + 
Pi{x)-^ +Pq{x), where P2{x), Pi{x), po{x) are some variable coefficients. 
Let us now find a connection between the GSOs Tf and the generalized 
differential operator Fx- It can be found using the Taylor expansion. 

Theorem 3 Let G Sigg he some Fourier basis, eonsisting 

of A- valued basis funetions. Then all GSO^assoeiatfA with it have the 
infinitesimal representation: Tf = (pjy (u), Dfj = (p^ifFA)-, and are ealled 
the operator-valued hyperharmonies assoeiated with an orthogonal basis 
where FxPuj{x) = wp>^{x), F^(p^{x) = xv?^(x). 

Proof: If the signals Tu){v) are decomposed into the following series 
of uj, ^k{v){uj)^ , then we can construct the operators 
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OO 

(u) = Xk{v)T>^. For these operators we have 

fc=o 




Toj{x) 






\k=0 




{v){u;)'^ ip^{x) = 



Tl^(v)cp^(x) = ip^{x a v) = f^ipuj{x), ( 39 ) 

i.e., Analogously, = (fu{^Lo)- Obviously, = (pu(Mx) 

and M;^ = (pjQ^(v). □ 

Using the hyperharmonics = (pf, (v) and = (fyiVY) associated 
with the basis i we can construct generalized Heisenberg- 

Weyl operators and quantum hyperharmonic analysis of quantum signals 
and spectra. 

The “time” -shift and “frequency” -shift operators together acting on 
spaces Sigo and SPo induce the following pair of sets of the Heisenherg- 
Weyl operators: 

HW, := | G O*, r; G o} , 

HW^ := I 1/ G O*, G O} . 

They act on Sigg and Spg by the following rules: 

:= (m^T;/) (x) = Mx)f{xSv), 



:= (m^D^^f) (u;) = pUv)F{co © u). 



Obviously, 



and 



{©’■>/(i)}= 45„W©-'>FH 



^0 

a;— 



■ {£<”'-'F(c)} = v>.(t>)£©’/(i)- 



Now we construct two sets of symmetric Heisenberg- Weyl operators: 






|i£ G G H 

These operators satisfy the following composition laws: 



G G h|, 

} 
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c^v,u]^v',u'] 

‘-'UI ‘-'UJ 




and the “commutation” relations 







2.4 Generalized Weyl quantizations 



Let us consider the linear quantum spaces Sig^ and Spj^ of quantum 
signals / and quantum spectra F, respectively. The inner product can 
be defined by := Tr(/i^), (AIT 2 ) := Tr(FiF|). It is easy to 

check that 




5{vHv')5{vHv'), 



5{v © v')5{v 0 v'). 



(40) 

(41) 



The families | I and | I form two quora in 

^ ^ [L'ltjjGn* x£7 I J [ii,L']G£7xr2* 

quantum spaces. For this reason, any quantum signal / G Sigj^ and 
quantum spectra F G Spi can be written as follows: 



/ = Q.F,{aW[/]} = Op{aW[/]}= J J AW[f]{l^,v)it^U^i{,y)d^i{v), 

(42) 



F = QF^{aW[F]'} (43) 

= Op{aw[/]} 

= 11 AW[F]{v,i^)s!^^^Uf,{v)df,{u). 



Using (40) and (41), one can invert (42) and (43) as follows: 



AW[f]{i^,v) 



QF~^{AW[f]} = Sym{f} = Tr 




(44) 



AW[F]{v,v) 



QF~^{AW[F]} = Sym{F} = Tr 




(45) 



The transformations (42) and (45) are called the generalized quantum 
Fourier transforms. 
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Example 5 In this example we eonsider the Weyl quantization on a 
finite eyelie group It = It* = 'Ljp, where p is a prime integer. In this 
case 

= 





‘ 1 




V 


‘ 0 


1 






el 








0 


1 


uv 

e 2 


e 2 










0 1 






eP-i 




1 




0 



For this reason, the map 

f = Q^,,[AW[f]} = Op{AW[f]} = E E = 

/pt>ez /p 





■ 1 


V 


'01 




el 




0 1 




e2 






n£li Ipv^TLjp 


eP-i 




0 1 

1 0 








(4 



is the diserete quantum Fourier transform assoeiated with the eyelie 
group Z. 



2.5 Generalized quantum convolutions 

For the product of two quantum signals / and g we have 

/?= y j AW[f]{n,v)i^fi''’^AW[g]{n',v')^/’^'Ufi{n,v)dp.{n',dv') = 

{u,v) {n',v') 

j (aW[/] ® AWb]) {w,x)if^d^{w)dx = 

(uj,x) 

{AW[f] ® AW[ff]} = Op{AW[/] ® AWb]} , 

where the expression 

(^AW[/] ® AW[g]J (w, x) = Ql^f^ifg} = sym{fg} = 

J FT(n,v)GT(LO Q iy,xBv)(pl{^{v)(py'^{v')dfj-{n)dv (47) 

{u,v) 




98 COMPUTATIONAL NONCOMMUTATIVE ALGEBRA 

is called the generalized twisted signal eonvolution. Analogously, 

FG = J j d^i{v')dv' = 

(v^v){v' ,v') 

j (aW[F]^AW[G]) (x, iu)£}^'‘^kx dnito) = 

(x,u) 

QF^ {AW[F]^AW[G]} = Op{AW[F]^AW[G]} , 

where 

(AW[F]^AW[G]) {x,io) := QF^^FG} = sym[FG^ = 

J AW[F]{v,u)AW[G]{xSv,iv Qu)ifl,G{v)iplG(^y'^(lv dy{u) (48) 

(v,u) 

is called the generalized twisted speetral eonvolution. 

According to the Pontryagin duality principle we can define the gen- 
eralized quantum eonvolution of quantum signals by 

f®g:= Op{AW[/]AW[ff]} = Q^i^){AW[/]AWb]}, 

where 

AW[f]{n,v)AW[g][n,v] = 

sym{f®g} = QF~^{f®g} = Tr (J®dj > 

and the generalized quantum eonvolution of quantum spectra by 

Fi^G := Op{aW[F]AW[G]} = Q^<^|aW[F]AW[G]}, 

where 

AW[F]{iz,v)AW[G][n,v] = 

sym {f^g} = QF~^ ^ [(^'^^) • 

Theorem 4 The quantum generalized eonvolutions and quantum gen- 
eralized Fourier transforms are related by the expressions: 

QF^ {AW[f] ® AW[g]} = fg, QF^ {AW[F]irAW[G]} = FG, 

and 

{/® ?} = AW[f]{n,v)AW[g][i',v], 

QFf} = AW[F]{v,v)AW[G][v,v]. 
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3. Conclusion 

In this paper we have examined the idea of generalized shift operators 
associated with an arbitrary orthogonal transform and generalized linear 
and nonlinear convolutions based on these generalized shift operators. 
Such operators allow one to unify and generalize the majority of known 
methods and tools of signal processing based on the classical Fourier 
transform for generalized classical and quantum signal theories. 
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Abstract 

In this lecture the group of rigid body motions is introduced via its 
representation on standard three dimensional Enclidian space. The rel- 
evance for robotics is that the links of a robot are usually modelled as 
rigid bodies. Moreover the payload of a robot is also usually a rigid 
body and hence much of robotics is concerned with understanding rigid 
transformations and sequences of these transformations. Chasles’s theo- 
rem is presented, that is: a general rigid body motion is a screw motion, 
a rotation about a line in space followed by a translation along the line. 

The lower Renleaux pairs are introduced. These are essentially sur- 
faces which are invariant under some subgroup of rigid body motions. 
Such a surface can be the matting surface for a mechanical joint. In 
this way the basic mechanical joints used in robots can be classified. 
These surfaces turn out to have other applications in robotics. In robot 
gripping they are exactly the surfaces that cannot be immobilised using 
frictionless fingers. In robot vision the symmetries of these surfaces are 
motions which cannot be detected. 

Next Lie algebras are introduced. The 1-parameter rigid motions 
about a joint are considered. The correspondence between elements of 
the Lie algebra and 1-degree-of-freedom joints is given. The exponential 
map from the Lie algebra to the group is dehned and used to describe the 
forward kinematics of a serial robot using the product of exponentials 
formula. The Rodrigues formula for the exponential map is derived 
using a set of mutually annihilating idempotents. 

The derivative of the exponential map is explored. In particular the 
Jacobian matrix for the forward kinematics is derived. More general 
results are also derived and these are used to show how the inverse 
kinematics problem can be cast as a set of ordinary differential equa- 
tions. 
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Keywords: Clifford algebra, Lie algebras, Lie groups, rigid body motions, robotics. 



1. Introduction — Rigid Body Motions 



A large amount of robotics is concerned with moving rigid bodies 
around in space. A robot here is usually an industrial robot arm as in 
figure 1, however most of the material presented here applies to par- 
allel manipulators and closed loop mechanisms. There are also some 
applications to mobile robots and robot vision. 

It is usual to think of the components that make up the links of the 
robot as rigid and also the payload or tool is often a rigid body. Hence, 
an important problem in robotics is to keep track of these rigid bodies. 

It is well known that any rigid transformation can be composed from 
a rotation, a translation and a reflection. Physical machines cannot per- 
form reflections and so we should really speak of proper rigid transfor- 
mations, excluding the reflections. The proper rigid transformations can 
be represented using 4x4 matrices. These can be written in partitioned 
form as 



A = 



R t 
0 1 



where ii is a 3 x 3 rotation matrix and t is a translation vector. 
The action of these matrices on points in space is given by. 






-b 



where p and p^ are the original and transformed position vectors of a 
point. 

There are two ways we can use these matrices to describe the position 
and orientation of a rigid body. If we use an active point of view we can 
agree on a standard ‘home’ position for the body, any subsequent posi- 
tion and orientation of the body is described by the rigid transformation 
which moves the home position of the body to its current position and 
orientation. 

In the passive viewpoint a coordinate frame is fixed in the body, now 
the position and orientation of the body is given by the coordinate trans- 
form which expresses the coordinates of points in the current frame in 
terms of those in the home frame. For historical reasons, this passive 
viewpoint seems to be preferred in robotics even though the active view- 
point is often simpler. 

These two viewpoints are related quite simply. The transformation 
given by the passive view is the inverse of the active transformation. To 
see this assume that the new frame has its origin at a point with position 
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Figure 1. A six-joint industrial robot arm. 




Figure A vector referred to two coordinate frames. 
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vector t with respect to the original frame. Now an arbitrary point p 
in the original frame has position vector p' = p — t with respect to the 
new frame, see figure 2. In general the orientation of the new frame will 
be different to that of the original, assume it is given by a rotation R. 
That is, the basis vectors of the new frame i^, and k', are given by, 

i' = M, j' = Rj, k' = Rk 

If we write the position vector of the point as 

p = xi + yj + zk 

in the original frame, then in the new frame the point will have coordi- 
nates, 

p = XI +y j + zk 

where = p' • = (p — and so forth. Hence the new coordinates 

can be written in terms of the old as 

= 0 - 

Compare this with the inverse of a general rigid transformation which is 
given by. 

The space of all possible rigid body transformations is an example of 
a Lie group. So we are led naturally to study Lie groups. 

2. Lie Groups 

In general, groups can be thought of as sets of symmetry operations. 
Originally these were symmetries of algebraic equations. We will be 
interested in symmetries of geometric figures here. 

The definition of a group is the abstraction of the properties of sym- 
metry operations. There are four axioms. So formally a group is a set 
G with a binary operation, usually written as juxtaposition. 

■ The group must be closed under its binary operation. 

■ The operation must be associative. 

■ The group must have a unique identity element. 

■ Every element of the group must have a unique inverse. 
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Notice that sets of square matrices can be groups quite easily, the 
group product is modelled by matrix multiplication, which is associa- 
tive of course. The identity element of the group is represented by the 
identity matrix. The inverse of a group element is represented by the 
inverse matrix, so it is important that the matrices in the set are all 
non-singular. 

Lie groups have to satisfy the following additional axioms. 

■ The set of group elements G must form a differentiable manifold. 

■ The group operation must be a differentiable map. As a map the 
group operation sends G x G — > G. On pairs of group elements 

£ G the map has the effect {gi, g 2 ) ^ gig 2 - 

■ The map from a group element to its inverse must be a differen- 
tiable mapping. On an element g ^ G this map has the effect, 

g ^ g~^- 

A differentiable manifold is essentially a space on which we can do 
calculus. Locally it looks like but globally it could be quite different. 
It is usual to think of such a manifold as patched together from pieces 
of The standard example is a sphere, which can be constructed 
from two patches, one covering the northern hemisphere and the other 
the southern hemisphere. The patches can also be thought of as local 
coordinate systems for the manifold, in general manifolds do not have 
global coordinate systems. 

The set of non-singular n x n matrices is an example of such a Lie 

group. The underlying manifold of the group is simply an open set in 

2 2 

, that is with the closed set of determinant zero matrices deleted. 
The other axioms are clearly satisfied, matrix multiplication is clearly 
differentiable and so is the inversion map. This group is usually denoted 
GL{n) for the general linear group of order n. 

Subspaces of this group can also be groups. A good example for us is 
the rotation matrices. These are 3x3 matrices which also satisfy, 

R = I, and det(i?) = 1. 

These equations define a non-singular algebraic variety in GL(3). It 
is possible to show that such a subspace is a differentiable manifold. 
In this case the group manifold is 3-dimensional projective space PM^. 
This space can be thought of as the space of lines through the origin in 
or alternatively as the 3-dimensional sphere with anti-podal points 
identified. The group is usually denoted 50(3), the special orthogonal 
group of order three. 
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Next we look in detail at the Lie group that we are most concerned 
with; the group of rigid body transformations. 



3 . 



Finite Screw Motions 



In robotics the group of rigid transformations is called SE(3), it is 
supposed to denote the special Euclidian group. It is the semi-direct 
product of 50(3), the rotations about the origin, with M^, the transla- 
tions. 

SE{3) = 50(3) xM^ 

In a direct product the factors would not interact. The semi-direct 
product indicates that the rotations act on the translations. This can 
be easily seen from the 4x4 matrices, if we multiply two such matrices 
we get, 

E 2 ^ 2 ^ f Rl _ f R 2 R 1 R2'^1 + ^ 2 ^ 

0 lyvo v~vo 1 

Notice how the first translation vector is rotated before it is added to 
the second one. 

The group manifold for a semi-direct product of groups is simply the 
Cartesian product of the manifolds of the factors. So the group manifold 
for SE{3) is simply the Cartesian product of with M^. Notice that 
this is therefore also the configuration manifold for a rigid body, each 
position and orientation of the body corresponds to a group element and 
vice versa. 

We end this section with a sketch of Chasles’s theorem. This, origi- 
nally geometric, theorem dates back to the 1830s, well before Lie’s work 
on ‘continuous groups’. The theorem states that every rigid transforma- 
tion (with the exception of pure translations) is a finite screw motion. 
That is, a rotation about a line together with a translation along the 
line, see figure 3. 

A finite screw motion about a line through the origin has the form 

where x is a unit vector along the axis of the rotation, 0 the angle of 
rotation, and p the pitch of the motion. Since the axis of rotation is x 
we also expect that Rx. = x, in other words x is an eigenvector of R 
with unit eigenvalue. 

In general, if the line doesn’t pass through the origin the transforma- 
tion can be found by conjugation. Suppose u is a point on the line, then 
we can translate u back to the origin, perform the screw motion above 
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Figure 3. A finite screw motion. 



and finally translate the origin back to u. This gives 

(o l) (o l) ^ 



Now Chasles’s theorem amounts to the following: Given an arbitrary 
rigid transformation it can always be put in the above form. That is, 
we have to solve, 




R 

0 






for p and u given t and R. Assuming we can find 9 and x from R then 
it is not too difficult to see that 



6p 



= X • t. 



This gives the pitch p. Now we have a system of linear equations for u. 



(J-i?)u= — x-t. 
^ ^ 27T 



These equations are singular, but the kernel of {I — R) is clearly x and so 
the equations are consistent and it will be possible to find a solution for 
u up to addition of an arbitrary multiple of x. In practice, it would be 
sensible to require that u be perpendicular to x. If the transformation 
is a pure translation then R = I and the above fails, this is the only 
case where u cannot be found. Pure rotations correspond to pitch zero, 
p = 0. Screw motions and pure translations are often taken to be screw 
motions with infinite pitch. 
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Notice that if we have two screw motions about the same line with the 
same pitch then these transformations commute. It is only necessary to 
check this for lines through the origin, the conjugation can be used to 
extend this easily to the general case. So for example we have 



^R{e2) ^x 



fR(01 + 02) 
V 0 



(dl+02)p 

2tt 

1 



X 



Remember, the rotations here are about the same axis. From this it 
can be seen that the set of all screw motions about the same line with 
the same pitch constitutes a group. That is, the set is closed under the 
group operation. These are the one-parameter subgroups of SE{3). 



4. Mechanical Joints 

At the end of the nineteenth century Franz Reuleaux described what 
he called “lower pairs” [7] . These were pairs of surfaces which can move 
relative to each other while remaining in surface contact. He took these 
to be idealisations for the most basic of mechanical joints. He found six 
possibilities, see figure 4. 

It is possible to give a simple group theoretic proof that these are 
the only possibilities. The key observation is that these surfaces must 
be invariant under some subgroup of SE{3). The subgroup represents 
the symmetries of the surface. For a joint, the subgroup will give the 
possible relative motions between the sides of the joint. 

To find these surfaces, consider the surfaces invariant under one- 
parameter (1-dimensional) subgroups. 

■ The pitch zero subgroups correspond to rotations about a line. A 
surface invariant under such a subgroup is just a surface of rotation. 

■ Infinite pitch subgroups correspond to translations in a fixed di- 
rection, so any surface of translation is invariant under such a 
subgroup. 

■ The subgroups with finite, non-zero pitch have helicoidal surfaces 
as invariants. These give the first three lower Reuleaux pairs. 

To find more Reuleaux pairs consider how the subgroups can be com- 
bined to form larger subgroups. Surfaces invariant with respect to these 
larger subgroups must have more than one of the above properties. 

■ So the cylinder is a surface of rotation and a surface of translation 
(and a helicoidal surface for any pitch). 
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Revolution 



Suifaceof 

Trandation 



Helicoidal 

Sncfoce 



Figure 4- 



The six lower Reuleaux pairs. 



■ A sphere is invariant with respect to the subgroup of rotations in 
space SO{3). That is, the sphere is a surface of rotation about any 
of its diameters. 

■ A plane with respect to SE(2) the group of rigid transformations 
in the plane. 

With a little care, it is possible to show that these are the only 
possibilities — Reuleaux found them all! 

In mechanical engineering, the joints corresponding to lower Reuleaux 
pairs have special names. Simple hinge joints corresponding to the ro- 
tational pair are known as revolute joints. Sliding joints corresponding 
to the translational pair are called prismatic joints and the joints cor- 
responding to the helicoidal pair are helical joints or sometimes screw 
joints. A ball-and-socket joint is more properly called a spherical joint 
and the last two pairs form cylindric and planar joints. 

Practical robots may have any of these joints, but it is most common 
to actuate revolute or prismatic joints. Revolute joints can be accu- 
rately and easily controlled with electric motors. When larger forces 
are required hydraulic rams can be used as actuated prismatic joints. 




no 
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However, for a hydraulic ram the length of travel is hard to control 
accurately. 

It is possible to find all the subgroups of SE(3). The zero-dimensional 
subgroups are the point groups, which are subgroups of the rotations 
only, and the 230 space groups familiar from crystallography. For the 
higher dimensional subgroups see Herve [4]. 

5. Invisible Motion and Gripping 

The lower Reuleaux pairs turn up in other places in robotics. 

Consider trying to constrain an object by placing frictionless fingers 
on the object’s surface. The fingers can only exert forces normal to the 
surface of the object. It is known that there are some surfaces which 
cannot be completely constrained by any number of frictionless fingers. 
These surfaces are precisely the lower Reuleaux pairs. 

It is straightforward to see that a lower pair is “ungrippable,” the 
symmetry will move the surface such that the velocity at any point on 
the surface is perpendicular to the contact normal. Hence, these motions 
cannot be constrained. 

It is a little harder to show that any ‘ungrippable’ surface must have a 
symmetry. Essentially, the space of normal lines to the surface are con- 
sidered as six-dimensional force-torque vectors called wrenches. For an 
‘ungrippable’ surface the normal do not span the six-dimension vector 
space of wrenches. Hence, there will be a dual six-dimensional vector 
which annihilates all normal lines, this vector will generate the symme- 
tries. It is an element of the Lie algebra, see below. 

In robot vision a common problem is to reconstruct the motion of 
objects given two or more successive images of the scene. Flow fields 
are constructed from differences in image intensity. However, it is well 
known that certain motions of objects cannot be detected in this way, 
Horn gives the example of a sphere rotating about any diameter, [5] . 

It is clear that motions of all Reuleaux pairs are undetectable in this 
way. It is also clear that these will be the only undetectable rigid body 
motions if we discount discrete symmetries of regular polyhedra exactly 
synchronised with the succession of images. For example, we would not 
expect to see the motion of a tetrahedron if it executes a symmetry 
between the images we capture. On the other hand we are not really 
limited to rigid motions here. There may be other motions which are 
invisible to such a system, but this would probably depend very much 
on the lighting conditions. For example, consider the motion of a sphere 
again, this time the sphere is moving away from the observer and at the 
same time is expanding at such a rate that the apparent contour in the 
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image remains constant. This would be detectable under point lighting 
but there might be lighting conditions under which a move like this is 
undetectable. 



6. Forward Kinematics 

From the above it is clear that motion about a 1-degree-of-freedom 
joint can be parameterised by a one-parameter subgroups. 

The general case is a helical joint where the corresponding 1-parameter 
subgroup can be written 

A{s) = (^ 



where x is the direction of the joint axis, p is the pitch of the joint and 
u the position vector of a point on the joint axis. Revolute joints are 
most common in practical machines, these correspond to subgroups of 
pitch zero. 



A{9) = 



R {I-R)u 
0 1 



Sometimes prismatic joints are used, the corresponding 1-parameter sub- 
group for such a joint has the form. 




where t is the direction of the joint. Notice in each of the cases above, 
the identity element is given when the parameter is zero. 

For a serial robot, such as the on illustrated in figure 1, it is important 
to find the transformation undergone by the end-effector. This problem 
is usually known as the forward kinematics of the robot. Let us fix a 
standard or home position for the robot. In the home configuration all 
the joint variables will be taken to be zero. Now, to move the robot to a 
position specified by a set of joint variables 6 ^ = (6*i, 62 -, O 3 , 64 , 65 , 9q) 
we can perform the motions about the joints in turn beginning with the 
distal joint 9q; the one furthest from the base of the robot. The motion 
of the end-effector about this joint will be given by the matrix Aq(9q). 
Next we move the fifth joint, the axis of this joint has not be affected 
by the motion about the last joint and so the effect on the tool is just 
^ 5 (^ 5 )- In this way we can work our way down the arm and deduce the 
overall transformation on the end-effector or tool to be. 



K{6) = A,{9i)A2{92)As{93)A4{94)A5{95)Ae{9e) 
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This expression represents the active transformation undergone by the 
robot’s end-effector relative to the home configuration. It is straightfor- 
ward to compute the ‘A’ matrices if we know the position of the robot’s 
joint axes in the home configuration. 

In the robotics literature it is also common to use tool-frame coordi- 
nates, and to give the forward kinematics in terms of this frame. Specif- 
ically, the tool- frame is a coordinate system fixed to the robot’s end- 
effector. We seek the passive, coordinate transformation which converts 
coordinates in a frame fixed in the base link of the robot to coordinates 
in the tool frame. Suppose that in the robot’s home position the active 
transformation from the base link frame to the tool frame is given by the 
matrix B. The overall transformation between the base link frame and 
the tool frame in an arbitrary configuration of the robot will be given by 
the transformation, B to the tool frame in the home position, followed 
by the transformation of the end-effector itself. That is K{6)B. This is 
still an active transformation, to convert it to a coordinate transforma- 
tion, as we saw above in section 1, we simply invert the matrix. Hence, 
the kinematics in terms of tool-frame coordinates is given by, 

7 . Lie Algebra 

In 1900 Ball published his influential treatise on “the theory of screws” 
[1]. With the benefit of hindsight it is clear that Ball’s screws were 
simply elements of the Lie algebra of SE(3). More precisely, the twists 
correspond to the element of the Lie algebra, the screws are elements of 
the projective space formed from the Lie algebra. This distinctions will 
not be used in the following. 

The theory of Lie groups and their Lie algebras was developed some 
years later by. Lie, Klein, Killing and Cartan among others. So now we 
are in a position to take advantage of the Lie theoretic viewpoint and 
update Ball’s original screw theory. 

The Lie algebra of a Lie group can be defined in several different ways. 
Historically, Lie algebra elements were thought of as infinitesimal group 
elements. Here we can think of the Lie algebra of a group as the tangent 
space at the identity element. To find the Lie algebra elements we take 
a curve in the group and find the derivative at the identity. 

We can use use the 4x4 representation of SE{3) to find its Lie algebra. 
As we saw above finite screw motion can be written, 

A{e) = 
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where, i? is a rotation through angle 6 about the axis x, and u is a point 
on the axis. This defines a sequence of rigid transformations parame- 
terised by 9. Moreover, when 0 = 0 we get the identity transformation. 

So taking the derivative at 0 = 0 gives a typical element of the Lie 
algebra, 

O - nu\ 

0 0 ) 

Here H = dR(0)/d9 is a 3 x 3 anti-symmetric matrix. This can be seen 
by differentiating the relation that ensures R is orthogonal 




de 



RR^ = 



dR 



dR^ 



d9^ ^^'d9 




when 0 = 0 we have that R = I and hence. 



dR{0) dR{0)'^ 

de ^ de 



Now a 3x3 anti-symmetric matrix corresponds to a 3-dimensional vector. 



/ 0 -uiz 

Q = I UJz 0 
\ 




where coi are the components of the vector, lj. With these definitions the 
product of the matrix H with any 3-dimensional vector exactly models 
the vector product with u>, that is, Hp = cj x p for any p. 

Hence, we can see that u: is the angular velocity vector of the body 
and uo is its magnitude. 

A general element of the Lie algebra se(3), is a matrix with the par- 
titioned form. 



S 





These matrices form a 6-dimensional vector space and it is often useful 
to write the elements of the space as six-dimensional vectors. 



s = 




8. The Adjoint Representation 

So far the group SE(3) has been defined by it 4 x 4 matrix repre- 
sentation. Sometimes it is useful to think of the group as an abstract 
object and consider different representations of it. A representation here 
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is a linear representation, that is, we represent group elements by ma- 
trices, or more genrerally linear transformations. The group product is 
modeled by matrix multiplication and inverses in the group by matrix 
inversion. For any group there are many such representations, here we 
look at a representation which can be defined for any group. Any Lie 
group acts linearly on its Lie algebra. This representation is known as 
the adjoint representation of the group. 

For SE(3) we can write this in terms of the 4x4 matrices as 

S' = ASA-^ 

where A = A(g) is the 4x4 matrix corresponding to some group element 
g. In terms of the 6-dimensional vectors representing the Lie algebra, 
we get, 

s' = Ad{g)s 

Here Ad( 5 f) is a 6 x 6 matrix representing the group element g. The 
general form of the matrices in this adjoint representation is. 




again i? is a 3 x 3 rotation matrix but T is a 3 x 3 anti-symmetric matrix 
corresponding to a translation vector t. 

This adjoint action of the group on its Lie algebra can be extended to 
an action of the Lie algebra on itself. In terms of the 4x4 representation 
of SE(3) this can be explained as follows. Take S 2 to be an element 
of the Lie algebra, as a 4 x 4 matrix. Now suppose A{6) is a one- 
parameter subgroups corresponding to a Lie algebra element S\^ that is 
dA{d)/d9 = S\. Now if we differentiate the adjoint action of the group, 
AS 2 A~^ and set 0 = 0 we get the commutator, 

51^2 - 525i = [5i, 52] 

In terms of 6-dimensional vectors the commutator can be written as 

ad(s.)s,= (^; !7")(vj) 

The notation ad, lower case, denotes the adjoint action of the Lie 
algebra on itself. The commutators of elements are a key feature of 
Lie algebras, in robotics these commutators turn up in the dynamics of 
robots and in many other applications. Notice that in the 6-dimensional, 
partitioned form of representation of the Lie algebra, commutators can 
be written, 

[si, S2] = ad(si)s2 = ( X c^2 \ 

^ ^ ' \^vi X a;2 -h X V2y 
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9. The Exponential Map 

The exponential map sends Lie algebra elements to the Lie group. In 
a region of the origin the map is guaranteed to be a homeomorphism, 
and hence we can use the Lie algebra as a coordinate system for the 
group but only in a patch containing the identity. For SE{3), this patch 
is quite big, we just have to keep rotation angles to between ivr. The 
exponential map provides the link between the Lie algebra and the group. 

In general, the exponential of a matrix X is defined by the series, 

= I + X + ^X'^ + ^X^ + • • • 

2! 3! 

where X'^ is the matrix product X'^ = XX etc. It is possible to show 
that the series converges and, if X is a matrix representing an element 
of a Lie algebra, then the exponential will be a matrix representing 
an element of the corresponding Lie group. The mapping is essentially 
independent of the representation. 

The product of a pair of matrix exponentials is generally not simply 
the exponential of the sum of the exponents. This is because the expo- 
nents may not commute. However, if 5 is a fixed screw, then it is clear 
that 

e0iSgd2S ^ g(ei+02)5 

So these are the 1-parameter subgroups again, and we can write, 

A{9) = 

This means that we can associate a screw S, to any 1-degree-of-freedom 
mechanical joint. We will see how to do this in detail in a moment. For 
now simply observe that the forward kinematics for a six-joint industrial 
robot, as outlined in section 6 above, can be written as 

K{6) = e®l'5le®2S'2g03<S'3g6»454g05>S'5ge6S'6 



The parameters 0*, are usually referred to as the joint parameters, or 
joint angles if the joints are revolute. This form of forward kinematics 
of serial robots was introduced by Brockett [2] . 

As promised, we next look at how to associate a screw to a mechanical 
joint. To each joint we can associate a 1-parameter subgroup. 






_(R |fx+(/-i?)u 



0 



where the screw S has the form, 

5 = 



H V 
0 0 
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If we look at the rotation part first we expect that x to be along the axis 
of rotation so that Rx = x. Differentiating this with respect to 9 and 
setting 0 = 0 we get, 

Dx = a; X X = 0 



From this we can conclude that u is proportional to x. In fact, it makes 
sense to define a; = x so that the parameter 9, becomes the rotation 
angle. Next, we look at u, a point on the axis of the screw motion. Here 
we expect. 



m 



I Op 
1 



that is, a point on the axis of the screw is simply translated alog the 
axis. The derivative at 0 = 0 gives 



5 



— I 2n 

0 



X 



That is. 



o _L 

im + V = — X 
27T 



Replacing x by a; and rearranging we get the result 

P 

V = u X a; H uj 

2tt 

Notice that if the pitch of the joint vanishes p = 0, that is the joint is 
revolute, then the screw we associate with it. 



“ j 

U X u J 

is simply given by the Pliicker coordinates for the line forming the axis 
of the joint. For helical joints we add the extra term to the v part 
of the screw, where p is the pitch of the joint. The above does not apply 
to prismatic joints, but they are easy to deal with. Using the same sort 
of argument as above, a prismatic joint which allows movement in the 
direction v will be represented by a screw. 




In general it is quite difficult to compute matrix exponentials, but in 
the case of rigid body motions we are lucky, the problem is relatively 
straightforward. 
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Consider a typical element from the 4x4 representation, 

/O V 

VO 0 

Any such matrix satisfies the polynomial equation, 

+ = 0 

this can be checked by direct computation. The equation factorises as 

5^ + \uj\^S^ = S^{S + i\uj\I){S - i\uj\I) 

Now we can find three matrix functions of S, labelled Pq, and P_, 
which satisfy = Pi and PiPj = 0 if i / j, such a system is called a 
system of mutually annihilating idempotents. These matrices are given 
by, 



Pn = 



P+ = 



P_ = 



1 



UJ 



-1 



iS^ + \u;\^I), 
rS^{S -i\uj\I), 



■S‘^{S + i\uj\I). 



here i = y/—l is the imaginary unit. We also require a nilpotent here, 
Nq = This nilpotent satisfies Nq = 0, NqP^ = 

NqP- = 0 and NqPq = Nq. 

The point of all this is that we can write our original matrix S as 
S = Nf) — i\iv\P^ + i|a;|P_ 

and so when we take powers of S we only need to take powers of the 
coefficients of P+ and P- since there are no cross-terms and the powers 
of the matrices are simple. So exponentiating we get. 



= I + No + - 1)P+ + (e*PI - 1)P_ 

It is also straightforward to see that I = Pq + P+ -|- P- so the exponential 
can be simplified to, 

= Pq + No + e"*l‘^lp+ + 

Finally, substituting back the Ss and replacing the complex exponentials 
with trigonometric functions we have the result 






= I + S + 



-(1 — cos |o;|)5^ -|- 1 — — sin |u;|)5^ 
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The corresponding formula for the group SO{3) is known as the Ro- 
drigues formula. 

We can produce a similar result for the adjoint representation of 
SE(3). A typical element of the Lie algebra in this representation is 
given by, 

ad(s) = 

In general we would expect a 6 x 6 matrix to satisfy a degree 6 polynomial 
equation, these matrices however, satisfy a degree 5 equation, 

ad(s)^ -|- 2|o;|^ ad(s)^ -|- ad(s) = 0 

This relation can be verified easily using the following relation between 
pairs of 3 X 3 anti-symmetric matrices, 

-h -h vn) = -|w| V 

In turn this relation can be verified by using the connection between 
these matrices and vector products, Vx = v x x for any vector x, and 
then using the familiar formula for vector triple products. 

The degree 5 equation above factorises as 

ad(s)^ -|- 2|(jp ad(s)^ -|- ad(s) = ad(s)(ad(s) -|- i|ti;|/)^(ad(s) — i\co\I)'^ 

So this time we have two repeated factors and hence we seek three idem- 
potents as before and two nilpotents, one associated with each repeated 
factor. 

^0 = 



P+ = 



N+ = 



P_ = 



N_ = 



■j-^(ad(s) -|- i|(j|/)^(ad(s) — 

(ad(s)"^ -|- 2|(jp ad(s)^ -|- |(j|"^/)/|w|^, 

— 1 ^ ad(s)(ad(s) — l|w|/)^(2 ad(s) -|- 3i\uj\I) 

— (2 ad(s)^ — ad(s)^ -|- 4|wp ad(s)^ — 31|wp ad(s))/4|(u|"^, 

ad(s)(ad(s) -|- l|w|/)(ad(s) — i\oj\I)‘^ 

(ad(s)^ — ad(s)^ -|- ad(s)^ — i\ojf' ad(s))/4|(u|^, 
ad(s)(ad(s) -|- i\oj\I)‘^{2 ad(s) — 3i\co\I) 

— (2 ad(s)^ -|- i\uj\ ad(s)^ -|- 4|w|^ ad(s)^ -|- 31|w|^ ad(s))/4|(j|"^, 

ad(s)(ad(s) -|- l|w|/)^(ad(s) — i\oj\I) 

(ad(s)"^ -|- i\uj\ ad(s)^ -|- IcuP ad(s)^ -|- ad(s)) 
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ad(s) = —i\io\P+ — ilwlAi.!. + i\uj\P- + i\uj\N- 
and as usual we also have, 



I = Po + P+ + P. 

Notice that with matrices P and N such that P^ = P, = 0 and 
PN = N the /cth power of their sum is simply, {P + N)^ = P + kN. 
Hence the exponential of a matrix ad(s) from the adjoint representation 
of se(3) can be written as 

gad(s) _ i)p_ + 



or 



gad(s) ^p^p + ilcule^l^'liV. 



Expanding the idempotents and nilpotents in terms of powers of ad(s) 
we finally obtain the result 



gad(s) = I p 

+ 

+ 

+ 



1 



2|td| 

1 

2|td|2 

1 

2 ^ 

1 



2|a;|4 



(3 sin |o;| — |u;| cos |o;|) ad(s) 

(4 — 4 cos |cd| — |o;| sin |o;|) ad(s)^ 
(sin |(x^| — |u;| cos |td|) ad(s)^ 

(2 — 2 cos |u;| — |cd| sin |u;|) ad(s)^ 



10. Derivatives of Exponentials 

Suppose we are given a path in a Lie group {SE{3) here) as an expo- 
nential, 

ff(t) = e^W. 

How can we find the derivative of this formula and what does it repre- 
sent? 

In the previous section we met the simple case where the path in 
the Lie algebra X(t) is just tXo where Xq is a constant element in the 
algebra. The derivative of such a path is simply, 

dt 
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However, when X(t) is more complicated this is no longer valid. The 
problem is that dX/dt = X does not necessarily commute with X so 
differentiating the expansion of the exponential gives 

—e^ = X + ^(XX + XX) + ^(XX'^ + XXX + X^X) + --- 

dt 2! 3! 

• • • + —^—{XX^ + XXX^-^ + • • • + X^X) + ■■■ 

[k + Ij! 

In [3] Hausdorff showed that 

=X + ^[X, X] + [X, X]] + ^[X, [X, [X, X]]] + • • • 

The right-hand side of this equation will be abbreviated to X^- Notice 
that this matrix is composed of sums of iterated commutators and hence 
Xd is an element of the Lie algebra. The Hausdorff formula above implies 
that 

dt 

In the rotation group SO{3) this means that X^ corresponds to the 
angular velocity of the motion. In SE(3), the group of rigid body mo- 
tions the corresponding vector is the velocity screw of the motion. The 
equation is important in several applications. In a typical dynamical 
simulation, for example, equations of motion would be solved numeri- 
cally to find the velocity screw or, angular and linear velocities. From 
these quantities the transformation matrices usually have to be found. 
It is essentially the above equation that has to be solved to find the 
rigid transformation undergone by the rigid body under consideration. 
In most commercial applications a rudimentary approach to this prob- 
lem is often taken. This is because the speed of computation is far more 
important than the accuracy. 

Using vectors to represent the Lie algebra the Hausdorff formula can 
be written neatly as 



Xd 



E 

k=0 



{k + l)\ 



ad'=(x)i 



with ad() the adjoint representation of the Lie algebra, as usual. 
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For SE{3) we can evaluate this infinite sum using the idempotents 
and nilpotents found in section 9 above. This gives 



E 

k=0 



1 



(fc + 1): 






(fc + 1)! 



k=0 

CO / -I \\k 






k=0 



k=0 



(fc + 1)! 

{k + l)l - 



k=0 

CO 7 / . I I /j. 



Evaluating the infinite sums gives 



E77T^»<i(8)‘ = P„ + ^(e-H-l)P+ 



fc =0 



(A: + l) 



+ 



-l\UJ\ 

1 



{{-i\uj\ - l)e-*l‘^l +1)A^+ 



1 



+ - 1)^’- 
i\(jj\ 

1 



- l)e*l‘^l 

Substituting for the idemponents and nilpotents gives 



+ — ((i|o;| - l)e*l^l + l)iV_ 



E 

k=0 



ad(s)^ = / + ^ (4 — |u;| sin |u;| — 4 cos |a;|) ad(s) 



(A: + l)! 



+ 

+ 

+ 



2 | u ;|2 

1 

2^ 

1 

2|u;|4 

1 



(4|a;| — 5 sin |(x^| + |u;| cos |t(;|) ad(s)^ 
(2 — |u;| sin |u;| — 2 cos |a;|) ad(s)^ 
{2\oj\ — 3sin |u;| + |a;| cos Iwl) ad(s)^ 



This format is not particularly useful for dynamic simulation, it would 
be more useful to have s as the subject of the equation. This would allow 
direct numerical methods to be used. Such an inversion is possible in 
general, see [3]. The result is another infinite series, however it is not 
necessary to follow this route since the above formula can be inverted 
more directly using the idempotents. Recall that Pq + + P_ = I so 

that for any constants a*, bi, 



{qqPq + a_)-R-|_ + 6-|_Ab|_ + tt—P— + b—N—) 



* ( — Pq H P+ 

ao a+ 



%iV+ + — P_ - ^N_) 
ai a_ at 



= {Po + P+ + P-) = I 
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The computations are a little more than can be comfortably done by 
hand, but are readily computed using a computer algebra package such 
as Maple or Mathematica. 

Let, Sd be the Lie algebra element satisfying = ad(s£i)e'^‘^*^®\ 

then we have the result 



s 




\ ad(s) + 



|ti;| + 3sin |w| 



+ 



wP 4|(j|(cos |w| — 1) 
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Notice the absence of a term in ad(s)^ in the above. 

Finally here, we look at the interpretation of these derivatives. Con- 
sider a general sequence of rigid motions parameterised by time t, then 
we can see from the above that 



dt 



Sd 



is an element of the Lie algebra and that it corresponds to the instan- 
taneous velocity. The 17 or a; part is the angular velocity while the v 
part is the linear velocity of a point on the screw axis. We can find the 
velocity of a point on the body moving with this motion as follows. At 
time t = 0 the point will be assumed to have position vector po, at any 
subsequent time its position will be given by. 




Differentiating with respect to time gives 




Hence the velocity of the point is. 



p = a; X p -|- V 

The screw Sd or will be called the velocity screw of the motion. 



11. Jacobians 

The exponential form of robot kinematics is useful for deriving differ- 
ential properties, for example Jacobians. 

Consider a point attached to the end-effector of the robot, in the 
robot’s home position the point has position vector p. Hence, at any 
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subsequent configuration of the robot the position of the point will be 
given by, 

If the joints are moving with rates, 9i, 62 , ■ ■ - Oq then the velocity of the 
point will be. 




■ dK ■ dK ■ dK 

C/6'1 C/6'2 C/6'6 




The partial derivatives are easy to evaluate at the home position. 



dK 



g^l'S'l . . . 



= Si 



So in the home position the velocity of the point is. 




{9181+6282 + ■■■ + 6 ^ 3 ^) 




This is more general than it looks. If we choose the current configuration 
of the robot to be the home configuration then the above argument will 
apply. So, Si here will be taken to mean the current position of the 
f-th joint. Alternatively, we could let s? denote the home position of 
the robot’s joint screws. Subsequent configurations for the joints will be 
given by, 

Sj = ad(s°) _ _ _ g6i_i ad(sO_j)gO 



Hence, we have that 



o9i 



So we can find the velocity screw of the robot’s end-effector by simply 
substituting the adjoint representation of the Lie algebra. Hence, 




Si^i -|- 82^2 + • • • + S6^6 



If we compare this with the Jacobian relation for the velocity = JO, 
we can see that the columns of the 6x6 Jacobian matrix must be the 
current joint screws of the robot. 



J = Si 



S2 



S 3 



84 



S 5 



S6 



A well known result. 
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An immediate application of the above is to robot control. Suppose we 
wanted the robots end-effector to follow a path given by a exponential, 
gs(t) _ ^ traditional approach to this problem, the rigid motions along 

the path would be computed and then a inverse kinematic routine would 
be used to find the corresponding joint angles. It is probably more 
efficient to find the joint angles by numerically integrating the equation, 

J6 = Sd 

For most current designs of industrial the Jacobian matrix J can be in- 
verted symbolically, although this doesn’t seem to be widely appreciated. 
Also Srf can be computed using the formulas given above. 

12. Concluding Remarks 

In this brief lecture there has only been enough time to give an outline 
of how group theory can be used in robotics. Several important appli- 
cations have been omitted in the interests of brevity. Although we have 
discussed the forward kinematics of serial manipulators the inverse kine- 
matics is an key problem in the subject. Here we know where we want 
to place the robot’s end-effector and we must compute the joint angles 
needed to achieve this. Although, an approach using differential equa- 
tions has been sketched above, this clearly runs into difficulties when 
the robot’s Jacobian becomes singular. The investigation of these robot 
singularities is another problem where group theory has been usefully 
applied, see [6]. For parallel mechanism, such as the Gough-Stewart 
platform, the inverse kinematics are straightforward, it is the forward 
kinematics which are hard. 

Robot dynamics requires another couple of representation of the group 
SE{3). First, the forces and torques acting on a rigid body can be com- 
bined into a single 6-dimensional vector called a wrench. The wrenches 
transform according to a representation dual to the adjoint represen- 
tation, the coadjoint representation. An accidental property of SE{3) 
means that the adjoint and coadjoint representations are similar. In 
Ball’s original screw theory velocities and wrenches were treated as the 
same sort of object, however, to make progress it is simplest to separate 
them. The second representation we need is the symmetric product of 
the adjoint representation. Elements of this representation can then be 
interpreted as 6 x 6 inertia matrices. 

There are many applications of group theory to robot vision. Now 
the group may be different. For example, there are formulations of the 
camera calibration problem that use the group 5'L(4) to model the possi- 
ble camera parameters. Certainly projective geometry will be important 
when discussing cameras and there are projective groups associated with 
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this geometry. The group relevant to geometric optics is the symplectic 
group 5p(6,M), this is the symmetry groups of a complex of lines in 
space. 

Finally, the above presentation has used matrices and vectors exclu- 
sively. There is some merit to using Clifford algebras to represent the 
group of rigid body motions and the various spaces on which it acts. 
Relations and formulas in Clifford algebra tend to be neater than their 
matrix counterparts and this is useful for symbolic and hand computa- 
tion. Another advantage of the Clifford algebra is that the group SE{3) 
can be represented very compactly and this allows us to study the geom- 
etry of the group itself. See Joan Lasenby’s lecture later in this meeting. 
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Abstract Classical relativistic physics in Clifford’s geometric algebra has a spino- 
rial formulation that is closely related to the standard quantum formal- 
ism. The algebraic use of spinors and projectors, together with the bilin- 
ear relations of spinors to observed currents, gives quantum-mechanical 
form to many classical results, and the clear geometric content of the al- 
gebra makes it an illuminating probe of the quantum/classical interface. 
The aim of this lecture is to close the conceptual gap between quantum 
and classical phenomena while highlighting their essential differences. 
The paravector representation of spacetime in APS is used in partic- 
ular to provide insight into what many pioneering quantum physicists 
have considered classically indescribable: spin- 1/2 systems and their 
measurement. 



Keywords: algebra of physical space, Clifford algebra, entanglement, paravectors, 
quantum/classical interface, quantum computing, quantum informa- 
tion, relativity 

I. Introduction 

This lecture continues the theme[l] that Clifford’s geometric alge- 
bras, in particular the algebra of physical space (APS), provide the lu- 
bricant for smooth paradigm shifts from Newtonian to post-Newtonian 
paradigms. The focus here is on the shift to the quantum view of the 
universe, and how APS can make the quantum/classical interface more 
transparent. 

After a brief review of paravector space and its ability to model space- 
time, we start the lecture by relating the classical eigenspinor of a parti- 
cle, that is the Lorentz rotor that takes it from rest to its motion in the 
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lab, to its relativistic wave function. We then show how the 2 -valued 
measurement of the spin arises naturally from the algebraic properties 
of rotors, and how the g factor and magnetic moment of elementary 
fermions result. The spin itself arises classically as a gauge freedom of 
the eigenspinor, and as we will see, the choice of a spin rate propor- 
tional to the total energy of the system leads to de Broglie waves and 
momentum eigenstates of the Dirac equation. Finally, we consider work 
on entangled spin states which are of interest in quantum computation. 

2. Paravector Space as Spacetime 

Recall from my previous lecture[l] that we use paravector space in 
APS to model spacetime. In addition to the three dimensions of physical 
space, paravector space also includes scalars. A typical paravector can 
be written 

p = pOj^p= p^^e^ ^ ( 1 ) 

where is a scalar and p a physical vector, {ei, e2, 63} is an orthonormal 
basis of physical space, and eg = 1 . The paravector basis {00,61,62,63} 
is the observer’s proper basis (at rest). Paravector space in APS is 
thus a four-dimensional linear space. It has a metric determined by the 
quadratic form, which for the paravector p is proportional to the scalar 
pp = pp = (p^) — p^, where p = p^ — p is the Clifford conjugate of 
p. Since the quadratic form is a scalar expression, insertion of the basis 
expansion (I) gives 



pp = {pp)g = p^^p'^ {e^e^)g = p^p'^V/My, (2) 

and the metric tensor = {e^eiy)^ automatically arising here is the 
Minkowski metric of spacetime. 

Rotations in paravector space preserve the “square length” pp of par- 
avectors and therefore also the scalar products (pq) 5 = 5 {pq + qp) be- 
tween paravectors. They have the form often called a spin transforma- 
tion[2] 

p^LpL\ ( 3 ) 

where L is a unimodular element (LL = I) known as a Lorentz rotor. 
Lorentz rotations are the physical Lorentz transformations of relativity: 
boosts, spatial rotations, and their products. Several Lorentz rotations 
can be applied in succession, for example p L^L 2 L\pL\l\l\. All the 
information about the transformation is contained in the composition 
L = L3L2L1 , which of course is another Lorentz rotor. Indeed, the 
Lorentz rotors form a group isomorphic to SL ( 2 , C) , the double covering 
group of SO+ ( 1 , 3 ) . 
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Lorentz rotors L are generated by biparavectors: 

L = ± exp (W /2) G $pm+ (1, 3) ~ SL (2, C) , 
where W is a biparavector 

If W is a vector (a timelike biparavector), then L = B = is a boost, 
whereas if W is a bivector (a spacelike biparavector) , then L = R = 
is a spatial rotation. Every rotor L can be uniquely factored into the 
product of a boost and a rotation [3]: 

L = BR . 

The spin transformations of paravector products are found by trans- 
forming each paravector factor (3). 

3. Eigenspinors 

Recall[l, 4] that the eigenspinor A of a system is an amplitude for 
the motion and orientation of the system. More precisely, it is a Lorentz 
rotor of particular interest, relating the system reference frame to the 
observer (or lab). It transforms distinctly from paravectors and their 
products: a Lorentz transformation L applied in the lab transforms 
A ^ LA. This characterizes A as a spinor. The eigen part of its name 
refers to its association with the system, which in the simplest case is a 
single particle. The eigenspinor is generally a reducible element of the 
spinor carrier space of Lorentz rotations. Any property of the system in 
the reference frame is easily transformed by A to the lab. For example, 
in units with c = 1 the proper velocity of a massive particle is taken to 
be ii = eo = 1 in the reference frame; that is, the reference frame at 
proper time r is an inertial frame that moves with the particle at that 
instant. In the lab, the proper velocity is 

u = AoqA'^ = AA'^, (4) 

which is also the timelike basis paravector of a frame moving with proper 
velocity u . The complete paravector basis of the moving frame (relative 
to the observer) is { = Ae^ A^” } with m = uq . Thought of as a passive 
transformation, A transforms the observer from the reference frame to 
the lab. 

3.1 Time evolution 

The eigenspinor A of an accelerating or rotating system changes in 
time, and we let A (r) be the Lorentz rotation at proper time r from the 
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reference frame to the commoving inertial frame of the system. Eigen- 
spinors at different times are related by 

A(t2) = L(t2,ti) A(n) , 

where by the group property of rotations, the time-evolution operator 

L{t2,ti) = A(t 2) A(n) 

is another Lorentz rotation. The time evolution of A is found by solving 
an equation of motion 

A = ^CtA = ^An,ef , (5) 

where the unimodularity of A (AA = 1) implies that Ct = 2AA = — 2AA 
is a biparavector that can be identified as the instantaneous spacetime 
rotation rate of the system as seen in the lab, and firef = AQ,A is its 
value in the reference frame. Note generally that operators from the left 
on A act in the lab whereas those from the right act in the reference 
frame. 

The equation of motion (5) allows us to compute time-rates of change 
of any covariant property known in the reference frame in terms of Ct 
or riref- For example, the acceleration in the lab can be expanded ii = 
d (AA^) jdr = AA^ -|- AA^^ = 2 ^AA^^^ and substitution of the equation 
of motion (5) gives 

ii = (nu)jj , (6) 

Here, ~ h indicates the real (grades 0-1-1) part of x. In 

terms of firef ; 

U = ^AJlrefAi^^ = A (riref)^? A^. (7) 

Note that relation (7) has the form of a Lorentz rotation of a paravector 
from the reference frame to the lab. Evidently the real (vector) part of 
the rotation rate Liref in the reference frame is just the acceleration of 
the system there. Equation (6) is just the covariant Lorentz-force equa- 
tion if we identify Ct = eFjm. This identification provides a convenient 
covariant operational definition of the electromagnetic field F. However, 
this definition is more restrictive than is required by the Lorentz-force 
equation, and as we see below, a generalization is needed in the presence 
of spin. 

The equation of motion (5) for eigenspinors, especially when supple- 
mented by algebraic projectors, offers powerful new tools for finding 
classical trajectories of charges in electromagnetic fields. Advantages of 
the eigenspinor approach are nicely demonstrated in new analytic solu- 
tions for the autoresonant laser accelerator. [5] 




Quantum/ Classical Interface 



131 



3.2 Gauge Transformations 

The eigenspinor equation (5) embodies more information than can 
ever be obtained from the Lorentz force equation alone: A gives not 
only the velocity AAl of the particle, but also its orientation. In this 
section we look at the gauge freedom associated with orientation and see 
that it leads to the concept of spin. 

The proper velocity u = AAl of a massive particle is invariant under 
transformations of the form 



A ^ KRq , (8) 

where Rq is any spatial rotation: i^o-^o ~ -^ 0-^0 = 1- Because Rn is posi- 
tioned on the right of A, it acts on the reference frame. Transformations 
that leave measured values unchanged are called gauge transformations, 
and if, as in Newtonian mechanics, we measure only the spacetime posi- 
tion (world line) of the particle and derivatives thereof, then the reorien- 
tation (8) of the reference frame is a gauge transformation. We therefore 
have the gauge freedom to choose the reference-frame orientation as we 
wish. 

The rotation Rq in (8) may be time dependent. Then, in order for 
the equation of motion (5) to remain valid, not only is the spacetime 
rotation rate in the reference frame transformed to the new reference 
frame, but it also adds an imaginary (bivector) part: 

^ref ^ Rq (^ref '^^o) ^0 ; (9) 

where — icjQ = 2RoRq is the spatial rotation rate that generates Rq. 
We can confirm from Eq. (7) that the two transformations (8) and (9) 
together leave the acceleration (and by integration, the proper velocity 
and the world line) of the massive particle unchanged. 

3.3 Free De Broglie Waves 

The gauge freedom of the classical eigenspinor equations thus admits 
an arbitrary rotation of the reference frame. It is reasonable to assume 
that for a free particle the rotation rate cjq is constant, and since we 
are free to choose the initial orientation of the reference frame, we can 
take uq = Woes. This rotation rate generates what we refer to as a 
“spin” rotation, with the spin rotor Rq = exp (— ie3Wor/2) acting in the 
reference frame. The eigenspinor of the free particle then has the form 



A(r) 



A(0) e" 



-ie3U!0T/2 



(10) 
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and satisfies the equation of motion 

A = -Arij-ef , f^ref = 

The linear form of the equation of motion (5) generally suggests the 
use of superpositions of solutions with different world lines, and hence 
with different proper times r. To be able to combine these we need to 
express the proper time r in particle coordinates x : 

r = {xu)s ={ux)s, (11) 

where u is the proper velocity of the particle. When r (11) is substi- 
tuted in (10), the result has the form of /ree de Broglie waves in covariant 
formulation, except that the phase is now a real angle of the spin rota- 
tion. The wavelength is fixed at the de Broglie value by relating the rest 
energy m of the particle to ujq by 

m = ]^huJo ■ 

In terms of the momentum paravector p = mu, the free eigenspinors 
( 10 ) are then 

A (x) = A (0) exp [-fes {px)g /h] . ( 12 ) 

4. Spin 

The experimentally verified existence of de Broglie waves suggests 
that the spin rotation in the free eigenspinor ( 12 ) is real and occurs at 
the high angular frequency associated with Zitterbewegung: 

u)Q = 2m!h = 1.55 X 10^^ s“^ . 

As introduced above, the proper paravector basis {e^} of the particle is 
transformed to the basis {w = Ae^Af} of a eommoving partiele frame 
(relative to the observer) by the Lorentz transformation A. In particular 
the moving time axis uq = n = AA^ is the proper velocity of that frame 
with respect to the observer. It is natural to ask whether any of the 
transformed spatial vectors, = Ae^Af, carries any special physical 
meaning. By taking 63 to be the rotation axis in the reference frame, 
the corresponding unit paravector U 3 is the only spatial axis of the corn- 
moving frame that is not spun into a blur in the lab. It is associated 
with the spin. We refer to the unit paravector U 3 = re as the (unit) spin 
paraveetor^ and note that ww = — 1 . 



^The physical spin requires an extra scalar factor of /i/2 to ensure the correct magnitude and 
units. 
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Since eg and are orthogonal, so are uq and U 3 and hence u and w : 

(u 3 Uo)s = {wu)s = (6360)5 = 0, 

and U 3 U 0 = wu is a biparavector (a spacetime plane). Its dual is another 
biparavector 

S = Aeie 2 A = —iAe^A (13) 

= —iwu , 

which in a frame commoving with the charge is the spin plane. The dual 
of the spin paravector w = iSu = is seen to be the triparavector Su. 
This identifies w as the Pauli- Lubanski spin paravector. 



4.1 Factoring A 

Before considering the equation of motion for the spin, recall that as 
with any Lorentz rotor, A can always be factored into the product of a 
real boost B = and a unitary spatial rotation R = : A = BR . 

The proper velocity of the particle is u = AA^^ = B^. It follows that the 
boost part of the eigenspinor is 

B = ^ l+“ = , “ + ^ (14) 

y^2m {m + E) 

where p = E p = mu . 

The second equality in (14) can be derived simply from the rela- 
tions = B, BB = 1 and B + B = {2B)g , since then B (^B + B^ = 
B {2B)g = u + 1, the scalar part of which determines {B)‘^ . However, 
there is a more general geometrical derivation which makes the result 
easy to comprehend and remember. Recall from my previous lecture 
that any simple Lorentz rotation transforms a paravector v as 

V ^ u = LvL^ = 

where is the part of v coplanar with the paravector rotation plane 
of L. If this plane contains v, = v and u = L'^v. Let w be a timelike 
paravector of unit “length” : uh = 1 . It is the time axis of a frame moving 
at proper velocity v (relative to the lab). Then Lv is the unit paravector 
in the plane of v and u, halfway between them; it is the result of rotating 
V with only half of the rotation parameter needed to get it to u. Thus, Lv 
should be parallel to v u. Since it should have the same unit “length” 
as u, it must be 



Lv 



V u 

{v u) {v u) 



V u 

V^2(I -h {uv)s) 
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and therefore 

L = 

-\/2 (1 + { uv ) g ) 

In the special case above (14), v = 1. 

4.2 Spin Precession and the gi-Factor 

The charge-to-mass ratio of a classical charged particle is independent 
of its magnetic moment. The former determines the cyclotron frequency 
of the charge in a magnetic field, and the latter determines its precession 
rate. The magnetic moment of a spinning charge is proportional to 
its angular momentum, and the proportionality constant is given by 
the fif-factor times the charge-to-mass ratio. Consequently, the ^-factor 
determines the ratio of the precession rate to cyclotron frequency of a 
spin- 1/2 fermion in a magnetic field. 

The eigenspinor of a particle describes both its translational and rota- 
tional motion. Compound systems with several particles or other inde- 
pendent parts may require several eigenspinors to describe their motion, 
but a classical “elementary” particle can be defined as one whose motion 
is fully described by a single eigenspinor. As we show below, the equa- 
tion of motion (5) for the eigenspinor remains linear for this eigenspinor 
only \i g = 2. 

The reference frame of a massive spinning particle at proper time r, 
related to the lab by the eigenspinor A (r) , is taken to be a commoving 
inertial frame of the particle. It is a rest frame of the particle in that 
the proper velocity of the particle in its reference frame is 1 . We assume 
that the eigenspinor A includes the spin rotor Rq = exp (— ie 3 (UoT/ 2 ). 
Because of its spin rotation, its reference frames at slightly different 
r are rotated by large angles with respect to one another. For many 
purposes, it is convenient to consider a nonspinning rest frame, which 
is intermediate between the reference and lab frames. Such a frame is 
defined to be related to the lab frame by a pure boost B. The eigenspinor 
A is the product BR of this boost with the spatial rotation R that 
rotates the reference frame into the nonspinning rest frame. Under most 
circumstances, R is dominated by the spin rotor Rq. 

The equation of motion (5) can now be written in a third way 

A = ^CtA = ^Afiref = \BCl,estR, (15) 

where the spacetime rotation rate forest in the nonspinning rest frame is 
related to that in the lab (Ct) and in the reference frame (Href) by 



^rest — BCtB — RCty-gfR . 



(16) 
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The resulting proper acceleration (7) 

ii = (rirt)sjj = A (riref)j{ = B (rii.est)j{ B (17) 

is consistent with the Lorentz-force equation if and only if 

(^rest)j{ — 6 (Fi-est)j{ (1^) 

where Erest = (Frest)^} is the electric field in the nonspinning rest frame. 
The minimal covariant extension of (18) is = eF/m, but this is not 
acceptable in the presence of spin since the spin rotation rate remains 
even when the electromagnetic field F vanishes. Indeed, according to (9), 
because of the spin term, forest contains an extra bivector (imaginary) 
term —iujQRe^E) = — iwos, where 

s = Re 2 ,R) (19) 

is the unit spin paravector in the nonspinning rest frame. We therefore 
need to consider other covariant extensions of the relation (18). 

The Lorentz-force equation is independent of the magnetic field in 
the rest frame of the charge. We need another equation of motion to 
relate forest to the magnetic field. An obvious one is spin precession. 
By differentiating s (19) with respect to proper time, we obtain the 
precession equation 

S = (rjRS)jj = UrXS (20) 

with ri/j = 2RR = — = —wr . The spin paravector w in the lab is 
related to s by 

w = AesAl = BsB , 

and the equation of motion for w can be expressed using either the first 
or second equality: 

w = -^ (AgsAI^ = i^w)^ = B {Q,restS)^B 
= {Qbw}^ + BsB, 

where Lis = 2BB. By now, we have defined a bunch of spacetime rota- 
tion rates, but fortunately, they are simply related. Note in particular 

firest = BQ.B = 2BAAB = 2B (^BR -F BR^ R 

= 2 [bB + RR^ =ft^j^ + flR. ( 21 ) 

For simplicity, consider first the case that the electric field vanishes 
in the nonspinning rest frame. According to the Lorentz-force equation. 
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(17) with (18), in this case, the acceleration ii vanishes, and consequently 
so do B and Ct b ■ The precession rate is then proportional to Brest , and 
the ^-factor is defined so that it is — ^e<7Brest/?Ti- The full spacetime 
rotation rate forest has only a bivector part 

^rest — i'p. Brest iiOQS, 

2m 

where the first term on the right gives the precession rate of the spin 
and the second term generates the spin rotation as mentioned above. 

More generally, the presence of a rest-frame electric field adds an 
acceleration eErest/^w to forest : 

^rest — (Brest T ^TrBrest ) ICVqS . 

Since 





iB 



rest 




and Frest = BFB, transformation to the lab gives the rotation rate 

Ct = BCtj-estB = F -|- uF^'u + - fP— ttF^u) -|- loqS 
2m 1 2 V /J 

= - — {2 + g)F + {2 — g) uF^u -|- (JqS , (23) 

4m L J 

where from (13) and (19) S = —iBsB. The equation of motion for w is 

w = {nw)y^ = ^ {2 + g){Fw)y^ + {2- g)(^uF'<uw'j^ , 

which is exactly the algebraic form of the BMT equation. [6] The term 
involving S drops out since Sw is a pure bivector and thus imaginary. 
If we demand that the eigenspinor evolution (5) is linear in A, then Ft 
should not depend on the proper velocity u.[7] Then, from (23), we must 
have g = 2 and the spacetime rotation rate Ft reduces to eF/m + wqS . 

Note that the imaginary part of relation (21) relates the precession 
rate in the nonspinning rest frame to that in the lab. The difference 
between the lab and rest-frame precession frequencies is known as the 
(proper) Thomas precession and is given by 

U^Th ~ i i^R ^rest)cf — i {^b)q — (^Bb''J ^ . 

It is easy to evaluate from the expression (14) for B, since the only 
imaginary contribution BB arises from the term {uu)q / (2y -|- 2) : 

2i u X u 



(24) 
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4.3 Magnetic Moment 

The torque physically responsible for the precession of the spin in a 
magnetic field arises from the coupling of the magnetic dipole moment of 
the particle to the field. This coupling can be found independently from 
the spin rotation rate (22) in the nonspinning rest frame. If we assume 
a magnetic field i?rest ^ muoofe (~ 4.414 x 10® tesla when ujq = 2m//i), 
the rotation associated with the intrinsic spin dominates: 



^rest — iuiQS 1 



mujQ 



-sB 



rest 



with 



~ -iujQs' ( 1 S • Brest 

mtoo 



s' = RisR\ ~ S ( 1 ^ (sBrest)y ) , 

Rl = exp ( — ^ (sBrest)y 

\ ZmujQ 



(25) 



The shift in magnitude of the proper rotation frequency, together with 
the association Hujq = 2m of that frequency with the mass of the charge, 
implies a potential energy that shifts the mass. From (25), this potential 
energy can be expressed 

eh 

“X S • Brest — A*' ■ Tlrest (^6) 

2m 

with a magnetic dipole moment that for the electron with e < 0 is 

At = — s = -gtiQ- = -gos , (27 

2m 2 

where hs/2 is the spin vector, = |e| hj (2m) is the Bohr magneton, 
and = 2 is the g factor. 



5. Dirac Equation 

A simple classical equation of motion[7, 8] follows from the Lorentz 
transformation p = AmA^ and the unimodularity of A : 

pA^ = mA . (28) 

This is the classical Dirac equation, which has the form of the quantum 
Dirac equation in momentum space. It is a real linear equation, in that 
any real linear combination of solutions A is another solution. 
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Elementary particles are often modeled classically as point charges. 
An obvious advantage of point charges is that they are simple and struc- 
tureless. A disadvantage is that their electromagnetic energy is infinite, 
requires mass renormalization, and leads to preacceleration. An alterna- 
tive approach is to assume no a priori distribution. Instead, a current 
density j (x) is related by an eigenspinor field A (x) to the reference- 
frame density pref- 

j (x) = A (x) Pref (x) A^” (x) . (29) 

This form allows the velocity and orientation to be different at different 
spacetime positions x. The current density (29) can be written in terms 
of a density-normalized eigenspinor dt as 

where 'h obeys the same classical Dirac equation as A: 

^ |'3Q^ 

As with the eigenspinor equation of motion (5), this equation (30) is 
invariant under gauge rotations dt ^ 'hi?, real linear combinations of 
solutions are also solutions, and free particle solutions have the form of 
de Broglie waves 

'h (x) = 'h (0) exp [— ies {px)g /h] . (31) 

Recall that we used our gauge freedom to choose the spin axis to be 63 . 
This restricts further gauge rotations to also be about 63 : 

'h ^ 'h exp {—ie^fi) , (32) 

where the scalar parameter effectively sets the angular position from 
which rotation angles about 63 are measured. 

If dt is assumed to be a real linear superposition of classical de Broglie 
waves (31), the momentum p can be replaced by a differential operator: 

p^^ = ihd^^e^ = m'h. (33) 

The differential form of p has implications for the remaining gauge rota- 
tions. If the rotation angle 2(j) is fixed, the transformation (32) is said to 
be a global gauge transformation and there is no change in the differen- 
tial form (33) of the Dirac equation. Indeed, we can multiply from the 
right by any constant element that commutes with 63 . However, if is 
a function of position x, then (32) is a loeal gauge transformation and 
the Dirac equation (33) picks up an additional term: 

ihd^^e^ -|- h{d(j)) = m'k. 
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In order to have an equation of motion that is invariant under local gauge 
transformations, we play the usual game of introducing the paravector 
gauge potential A as part of the “covariant derivative” : 

= ihd^^es — = m'h, ( 34 ) 

and the gauge transformation ( 32 ) is accompanied by a gauge transfor- 
mation of A : 

eA ^ eA + h {dcf) . 

The result ( 34 ) is a covariant algebraic form of the usual quantum Dirac 
equation in APS. 

To cast the equation into a form that makes it easier to solve, we split 
it into two complementary minimal left ideals of APS with the projectors 
P±3 = 5 (1 ± 63) : 

p'P^P+3 = {ihd — eA) 'l'^P+3 = m'kP+3 
p'P'l’P_3 = {—ihd — eA) ^^P_3 = m^'P-s , 

where we noted the “pacwoman” property esPj-s = ±P±s . We can bar- 
dagger conjugate the second equation to project it into the same minimal 
left ideal of the algebra as the first: 

p4»P+3 = (ihd — eA) 'kP_|_3 = m^^P+3 . 

Then, defining 




we obtain a pair of projected equations in the matrix form 

This is fully equivalent to our single algebraic equation ( 34 ). The pro- 
jector P3 gobbles 63 and changes the spin rotor exp (— ia;oTe3/2) into a 
simple phase factor. If the standard matrix representation ^ is 
used for APS in terms of the Pauli spin matrices Eq. ( 35 ) is the tra- 
ditional form[ 9 ] of the Dirac equation with gamma 

matrices in the Weyl (or spinor) representation. The projection of the 
algebraic 'k by P±3 picks out the upper and lower components of 
and is seen to be equivalent to multiplication of by the traditional 
chirality projectors ^ (1 ± 75) with 75 = — i7o7i727s : 

2 (1 75) 77 'kP±3 • 
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The Dirac bispinor in the Dirac-Pauli (or standard) representation is 
related by 




where ^ ('h ± are the even and odd parts of 'h and corre- 

spond to the large and small components at low velocities: 

('!'>+ = pin : b >+ r = pin\I^^R == pHIr 

The last expression for (4')^ on the RHS is the low- velocity approx- 
imation. In the rest frame, the small component disappears and the 
eigenfunction is even. We say the particle has even intrinsic parity. 
More generally, the solutions (35) and (36) are 4x2 ma- 

trices whose second columns are zero and whose first columns give the 
usual Dirac bispinors of quantum theory. If R is replaced by the de 
Broglie spin rotor R = exp [— iea {px) g /h] , the solutions in the nonvan- 
ishing columns of and are the usual momentum eigenstates 

of the Dirac equation. 

5.1 Spin Distributions 

To study spin distributions, the low- velocity limit 

4/ cx (4/)^ cx plifR (37) 

is sufficient. In terms of Euler angles 4>, 6, y, about space-fixed axes, the 
rotor R can be expressed by 

R = exp (— ze3(/)/2) exp (— ie20/2) exp (— ie3x/2) . (38) 

The classical spin direction in a static system is s =Re^R^ [see (19) 
above], where R may be a function of position r. A distribution of such 
spin directions is thus prefResR^ where the positive scalar pref = /^ref (r) 
is the density of spins in the reference frame. As seen below, simple 
measurements of the spin direction give only one component at a time. 
The distribution of the component of the spin in the direction of an 
arbitrary unit vector m is 

Prei s ■ m = ( pj-eiResR^ m) . 
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In terms of the projector P+ 3 , since 63 = P +3 — P _3 and for any elements 
Pi Qi {pq) s ~ i^P) s ~ S ’ distribution is 



2 (^p,e{RP+3R^m^ ^ = 2 ^P+3i?t/,V2 ^^ 1/2 

= tr , (39) 



where by / we mean the ideal spinor, whose standard matrix represen- 
tation is 



i, = p,Srp+, = 



— p-*x/2„l/2 



Prei 



e-^PI^cos9l2 0 \ 

^uj>/2 gin5/y'2 0 / 



If the column of zeros is dropped, the standard matrix representation of 
■0 is a two-component spinor, as familiar from the usual nonrelativistic 
Pauli theory. Such spinors carry an irreducible representation of the 
rotation group SU (2) ? It may be noted that the full rotor R can always 
be recovered from the ideal projection RP 3 as twice its even part: 



R = 2{RPs)^ . 

The term is a scalar and tr can be omitted from (39). Although 

we derived the spin distribution as a classical expression, it has pre- 
cisely the quantum form if we recognize that the matrix representation 
of the unit vector m, namely m = m^ej m/ax + m/ay -\- m/az , is 
traditionally (but misleadingly, since it represents a vector, not a scalar) 
written m • cr . From the definition of Pi-ef the spinor if satisfies the usual 
normalization condition, 

2 J d/x (ip^ip'^ = (V'1'0) = 1 

and the average component of the spin in the direction m is 

2 J d^x (^ip^mif^ ={if\m\if). 

From expression (39) we see that the real paravector Pg = i?P+ 3 i?t = 
^ (1 -|- s) embodies information about the classical spin state at a given 
point in space or in a homogeneous ensemble. It is equivalent to the 
quantum spin density operator g ~ ifif^ for the pure state of spin s. 



^More generally, once projected onto a minimal left ideal, any spinor df = p//f BR is equiv- 
alent to a dilated spatial rotation, thereby reducing this representation of the noncompact 
Lorentz group to a scaling factor times the compact group SU (2) . 




142 



COMPUTATIONAL NONCOMMUTATIVE ALGEBRA 



and it is also a projector that acts as a state filter. The component 
of the spin in the m direction is 2(Psm)_g, whose matrix representa- 
tion is identical to the usual quantum expression, traditionally written 
^tr {fjm • it} . Note that the part of a rotation R around the spin axis 

becomes a phase factor of V’ = since the pacwoman prop- 
erty gives P+3 = e“*^/^P+3 . Thus, when phase factors of ■0 are 

ignored, information about the axial rotation of R is lost. Good quan- 
tum calculations keep track of relative phase, and this appears to be the 
only aspect of the phase (rotation about 63 ) that can be determined 
experimentally. 

One way of seeing whether the system is in a given state of spin n 
is to apply the state filter to the spin density operator q and see what 
remains: 

Pn^'Pn = (Pn£» + ^Pn) Pn = 2 {PnQ)s Pn • (40) 

The scalar coefficient 2 (Pn^>)_5 = ((1 -|- n) is the probability of finding 
the system described by q in the state n . For a system in the pure state 
^>=Ps = ^(l-|-s), the probability is 

2 (PnPs)s = ^ ((1 + “) (1 + ®))5 = ^ (1 + n • s) • (41) 

This is unity if the system is definitely in the state n, whereas it vanishes 
if the system is in a state orthogonal to n . Thus, s = n is required for 
the states to be the same and s = — n for the states to be orthogonal. 
Note that the mathematics is the same as used in my first lecture to 
describe light polarization. [1] 

5.2 Spin I and State Expansions 

The value of ^ for the spin of elementary spinors considered here arises 
in several ways. It is the group-theoretical label for the irreducible spinor 
representation of the rotation group SU (2) carried by ideal spinors. It 
is also required by the fact that any rotation can be expressed as a linear 
superposition of two independent orthogonal rotations defined for any 
direction in space. The Euler-angle form (38) of any rotor R can be 
rewritten 



R = exp (-zn6*/2) exp [-ze3 {(f) -h x) /2] (42) 

( 6 6 \ 

cos - - in sin - j exp [-ie3 (0 -h x) /2] , 

where n = exp (— ie3i/)/2) e2 exp (ie3i/)/2) is a unit vector in the eie2 
plane. Therefore, any rotor i? is a real linear combination cos -|- 
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sin of rotors R'l = exp [—^63 {(f + x) /2] and Ri = — ini?| that are 
mutually orthogonal: = (— in)^ = 0 . 

By projecting the rotors with P_|_3 we obtain the equivalent relation 
of ideal spinors: 



6 6 

RP +3 = cos -R^P +3 + sin --R^P +3 

9 9 

/ = Prei ^P+3 = COS -V'T + sin (43a) 

V'T = Prf e"*('^+^^/^P+3, = -inV't • 

Traditional orthonormality conditions hold: 

2 = 2 ^ = 2pi.ef (P+a^n)^^ = 0 

2 = 2 = 2pref (P+3)5 = Pref • 

It follows that the amplitudes are 

{///) = 2 J d^x (V’V’I)^ = cos ^ 

{///)= 2 j d^x = sin^ 

giving probabilities as found above in Eq. (41). 

KV’tIV’)^ =cos^^ = ^(1 + 8 - 63 ) (44) 

KV’tlV’)^ =sin2^ = i(l-s-e3). (45) 



By tacking on an additional fixed rotation, the treatment can be ex- 
tended to measurements along an arbitrary axis. 



5.3 Stern-Gerlach Experiment 

The basic measurement of spin is that of the Stern-Gerlach experiment 

[10] , in which a beam of ground-state silver atoms is split by a magnetic- 
field gradient into distinct beams of opposite spin polarization. It is a 
building block of real and thought experiments in quantum measurement 

[ 11 ] . 

Consider a nonrelativistic beam of ground-state atoms that travels 
with velocity v = Ua;ei through a static magnetic field B that vanishes 
everywhere except in the vicinity of the Stern-Gerlach magnet. The net 
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effect of the magnet on the beam is a vertical force proportional to the z 
component Hz of the magnetic dipole moment, which we take to be the 
magnetic dipole moment of an electron (27), corresponding to an atomic 
ground state with an unpaired electron in an S state: 

dBz dBz 

We assume the beam is uniform along its length with pref = 1 and that 
dBzjdz < 0. The eigenspinor for the initial beam in the nonrelativistic 
limit is then 

= BR ~ R . 

The key point is that, as seen above (42), every rotor is a linear combina- 
tion of “up” and “down” rotors with opposite spin directions. For a state 
in which s makes an angle 6 with respect to 63, R = cos |ii| -|- sin 
with 

R]e3R\ = es, RiesR\ = -e^ . (46) 

The components of the spin unit vector 

se3 = (fle3flte3)^ = | +1; 

are also opposite for the rotors R^ and R^ so that the Stern-Gerlach mag- 
net forces the eigenspinor components R-^ and R^ of the beam eigenspinor 
upwards and downwards, respectively. This splits the incident beam into 
two isolated branches, analogous to the way a birefringent crystal splits a 
beam of light into two branches of orthogonal polarization. The fraction 
of the initial beam in the upper branch is cos^ 0/2 = ^(l-|- cos 0 ) whereas 
in the lower branch the fraction is sin^ 0/2 = |(1 — cos 6 ) , just as found 
from the square amplitudes (44) and (45) above. The two-valued prop- 
erty of the measurement is a direct result of the decomposition of any 
rotor R into rotors for rotations about any two opposite directions, ro- 
tations that correspond to “spin-up” and “spin-down” components. 

These relations can be re-expressed in the form of eigenspinor ideals, 
which is closer to the quantum formulation. Since the rotors R-^ , R^ are 
unitary, relation (46) can be cast in a form 

e^R^ = R^G3, e^Ri = —Rie^ 

that becomes an eigenfunction equation when projected onto the P+3 
ideals = R 11 F 3 : 



esV'T = = -V’t • 
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Evidently 63 =P3— P3 is the spin operator corresponding to the z com- 
ponent 8-63 of the spin in units of /i /2 for the P3 ideal representation 
of The projected rotor is split into RP3 = / = cos |' 0 i' -|-sin the 
two parts of which experience opposite forces from the Stern-Gerlach 
magnet. 

6. Bell’s Theorem 

Quantum systems are often tested against Bell’s theorem. Violation 
of Bell’s inequalities is generally held to be evidence of nonlocal quan- 
tum weirdness. A simple proof of the inequalities can be derived from 
polarization splitters acting on a single beam. The nice thing about this 
derivation is that the question of locality vs. nonlocality does not arise. 
It shows that Bell’s inequality concerns directly the difference between 
classical probabilities for particles and quantum probability amplitudes 
in waves. Nonlocality arises only in EPR-type applications with entan- 
gled states, but the inequality itself is violated locally for a single beam 
of waves. 

Consider a beam of silver atoms passing through a sequence of Stern- 
Gerlach magnets aligned to split the beam into spins polarized in op- 
posite directions specified by unit vectors ±a, ±b, ±c. If one of the po- 
larized beams leaving a given Stern-Gerlach magnet is blocked, that 
magnet serves as a polarization filter. Let /(c, b,a) be the fraction 
of atoms passing through sequential polarization filters of types a, b, c. 
Then, with “classical” atoms (the same math works for polarized pho- 
tons), the fraction that pass through filters a and then c is just the sum 
of those that pass through a, b, c and a, — b, c : 

/ (c, a) = / (c, h,a) + f (c, -b, a) . 



Similarly, 



/ (-C, b) = / (-C, h,a) + f (-C, b, -a) 

/ (c, h,a) + f (-C, b, a) = / (b, a) . 

Adding these three relations we obtain 

/ (c, a) + f (-C, b) = / (b, a) + f (c, -b, a) + f (-c, b, -a) 

>/(b,a) (47) 

which is a form of Bell’s inequality. We found above (41) that / (c, a) = 
^ (1 -|- c • a) so that Bell’s inequality (47) can be written 



l-|-ca — cb>ba. 
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If we let b bisect a and c, so that we can put c • a = cos 29 and 
c • b = b • a = cos 0 , we obtain 

1 + cos 26 >2 cos 6. 

However, this inequality is broken for 9 = vr/4, for example. The error is 
in working with fractions or probabilities instead of amplitudes. Using 
our filter method (40), we find indeed that 

/ (c, a) = 2 (PcPaPc )5 = ^ (1 + a • c) 

= 2 (Pc (Pb + Pb) Pa (Pb + Pb) Pc)^ , 

but this is not the same as 

/ (c, b, a) + / (c, -b, a) = 2 (PcPbPaPbPc)^ + 2 (PcPbPaPbPc)^ 

= (PcPbPc)^ (1 + b • a) 

+ (P cP bP c)5'(l~b-a) 

= i (1 + b • c b • a) 

unless b = a or b = c. 

7. Qubits and Entanglement 

The basis of quantum computation [12] lies in the replacement of the 
classical bit, with its discrete binary values ( 1 , 0 ) [for (on, off) or (true, 
false)], by a two-level quantum system, called a quantum bit or quhit^ 
that can generally exist in a superposition of its two states. The unitary 
evolution of a coupled system of qubits in an arbitrary superposition 
of states simultaneously computes the evolution of all binary sequences 
in the superposition, and this massive quantum parallelism offers huge 
advantages for certain problems, such as factoring large numbers and 
some types of large searches. 

The simplest example of a two-level quantum system is the spin-1/2 
system considered above. The same formalism can be applied to qubits. 
Here, we mainly formulate the algebra in the minimum left ideal C^sPs 
of APS and tensor products thereof, since these lie closest to the quan- 
tum formalism. We first consider operations on single spin systems in 
the language of qubits and quantum computing, and next present some 
extensions to systems of two or more qubits. 

7.1 Qubit Operators 

There is arbitrariness in the phase as well as in factors of 03 in 
and V’j,- Let us therefore define a standard ideal spinor basis, which we 
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IT) = P3 

li) = ei IT) = eiPs IT) = eiesPs |T) = -iesPs IT) • 



The unipotent (ef = 1) element ei interchanges these: ||) = ei |T) and 
IT) = IT) • We may think of the “up” state |T) as representing the bit 
value 1 , and the “down” state |T) as the bit value 0 . Evidently, ei acts 
as the NOT operator. It is also a reflection operator, since for any vector 

V 

V ^ eivei = — v*“ = 2v • ei ei — v 

keeps the component of v along ei constant and reverses components 
perpendicular to ei. The dual to ei is — iei = exp (— i7rei/2) , a rotor 
for the rotation of tt about ei. If it is applied twice, the result is a 27 t 
rotation which changes the sign of any spinor for a spin-1/2 system: 
(— iei)^ = — 1 . Note that any unit vector in the ei2 plane can serve 
as the NOT operator. The resultant states differ only in their relative 
phase. 

A state in which the spin is aligned with ei is an eigenstate of the NOT 
operator. Such a state is obtained by applying the rotor exp (— i7re2/4) 
to P3, which rotates the spin from 03 by tt /2 in the 6361 plane. It can 
be expressed in several ways: 



g-i7re2/4p^ 



/ 1 - ie2 

V v/2 



P3 



1 + ei 

“TT 



es + ei \ 

v/2 J 



P3 . 



In the fourth form, we see the eigenstate of the NOT operator is also 
obtained by a reflection of the up basis state in the unit vector h = 
(ei 4-63) /\/2 = = €“*’^^2/^63 . This reflection is called the 

Hadamard transformation. 

Since any state can be expressed as a linear combination of the basis 
states P3 and eiPs, we can find the effect of an operation on any state if 
we know its effect of the basis states. In the Hadamard transformation. 



lt)-h|T) 

li)-h|T) 



IT) + U) 

v/2 

hei IT) = 



IT)-Ii) 

^/2 



7.2 Exponential Forms 

Rotations can be written in exponential form with a scalar parame- 
ter proportional to the angle of rotation. They are thus connected to 
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unity (scalar parameter = 0) and can easily be interpolated. Reflec- 
tions, on the other hand, are usually viewed as discrete transformations, 
performed entirely or not at all. However, unit vectors can also be 
extended to exponential form. Such exponential forms are useful for 
working out physical interactions in the Hamiltonian that induce the 
reflections through the time evolution operator. 

Consider first the dual —in of the real unit vector n. It is a rotor 
for a rotation by angle tt about the n axis and can be viewed as the 
continuous transformation exp (— ina/ 2 ) when a = tt : 

—in = exp (— ina/ 2 ) , a = n . 

Thus the unit vector n can be expressed 

n = iexp (— in7r/2) = exp [i (1 — n) tt/2] 

= exp [iPnTr] = exp [iPno] , a = tt . 

In other words, it is part of a continuous transformation generated by 
the projector P„. At intermediate angles a we note that 

giaP ^ giaP ('p ^ ^ p ^ giap _ 

7.3 Raising and Lowering Operators 

We saw above that in the ideal-spinor representation, the unit vector 
n represents the spin component in the direction n in units of h/2. One 
may verify that the commutation relations of unit vectors in space are 
just what one needs for the spin components, and of course that is why 
the Pauli spin matrices were introduced to represent electron spin. The 
operator for the squared spin is thus 




The raising operator is defined by ||) = ||) , •s+||) = 0. Any 
non- null element that annihilates P 3 = ||) must have the form XP 3 , and 

xh U) = xPseiPs = xei ||) = ||) 

is satisfied by x = ei. Thus, we can put 

s+ = eiPs = ^{ei+ ie2) = P3ei . (48) 

Similarly, the lowering operator is 

s_ = eiPs = 4 



( 49 ) 
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The operators are examples of nilpotent operators: = 0 . Their 

products are 

s+s- = P3, s_s+ = P3 , 

which gives + s_s+ = 1 and = 63 . 

7.4 Two-Qubit Systems 

We represent multi-qubit systems very generally as tensor products 
of single qubit systems, that is, in tensor products of ideals of APS: 
(8> C^sPs- This representation is sufficiently general to handle dis- 
tinguishable quantum systems. If the systems are identical, additional 
symmetries apply. Tensor (or “Kronecker”) products obey the funda- 
mental relation 

(a <Sib) {c<Si d) = ac<Si bd 

Label two qubits A and B. With two qubits or spin- 1/2 systems, the 
projector is the tensor (or “Kronecker” ) product P3(8>P3 and the general 
pure state can be written 

/’AB = 'Lab (P3 <S> P3) • 

If 'k AS is a product state, it can be expressed as a single product term 
'Lab = 'La <8> 'I'b so that 

/’AB = 'Lab (P3 P3) = ('I'a < 8 ) 'ks) (P3 P3) = 'kAPs <S> 'ksPs 

and the density operator is also a single product 

QAB = /abiPab = (P3 <S> P3) ('I'a < 8 ) 'ks)'*' 

= (^'A ^b) (P3 P3) (^A ^b) 

= (^aP3^^) ® (^bP3^L) 

= QA® QB ■ 

Such systems, whose density operators can be expressed as a single ten- 
sor product of subsystem parts, are decomposable. In a decomposable 
system, measurements in one subsystem are independent of those in an- 
other. There is no entanglement and indeed no correlation between the 
subsystems. A system whose density operator is a mixture of decom- 
posable density operators 

QAB = X] ® 6 b 

j 
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is called disentangled or separable. An entangled state is not separable. 

A redueed density operator for one subsystem of a composite system 
is found by summing over all the states of the other subsystems; this is 
equivalent to tracing over the other subsystems. We can write 

Qab = trfi {qab} 

for the reduced density matrix of subsystem A after summing over all 
states of subsystem B. In the case of a product state qab = Qa'Si Qb-, 

Qab = {qa < 8 ) qb} = Qa® trs = Qa ■ 

It is often stated that measurements in one subsystem affeet the out- 
come of those in another, but such language is misleading and based on 
classical probability interpretations. It suggests a cause- and-effect rela- 
tion that propagates faster than the speed of light, and such propagation 
is not physically possible by relativistic causality. In an entangled sys- 
tem, if measurements on subsystem A are only recorded when a given 
measurement result is obtained on subsystem B, then the results on A 
will depend on the measurement outcome selected on B. However, if re- 
sults are recorded for A regardless of the measured value for B, then the 
A results cannot depend on the actual measurement performed on B. 
Mathematically, the density operator for A in that case is the subtrace 
of Q over B, and that subtrace is independent of basis and therefore 
independent of which property eigenstates are used. 

Arbitrary states, including entangled (and hence inseparable) ones, 
can be expressed as linear combinations of product states: 

AB (P3 ® P3) = ^ Cjk ['I'a i^j) ® ^B (P3 ® P3) • 
j,k 

The corresponding density operator is 

QAB = '^AB (P3 <s> P3) '^^B ■ 

The average value of spin A in the direction a and, simultaneously, spin 

B in the direction b is 2 ^ ( qab^ < 8 > b ) where the subscript S ® S 

\ / s®s 

indicates that the scalar part is to be taken in the spaces of both spin 
A and spin B. Similarly, the probability that spin A, upon appropriate 
measurement, is found in state Pg^ and spin B is found in state Pg^ is 

2 ^ (^?ABPsA<^PsB)5',g,S' • 

One way that linear superpositions of product states arise is in reduc- 
tion of representations of the rotation group. If we rotate '^ab by an 
additional rotation with rotor R we get 



'^AB R® R'^ab 
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and the rotation factor R 0 R is a product representation of SU (2). 
For example, suppose we rotate each spin by vr about the axis a. Then 
R = exp (— ia7r/2) = — ia and R<Si R = —a (8> a. 

The rotation factor R<Si R can be reduced to a direct sum of spin-1 
and spin-0 parts: 

R(^Rc^UR(^RU^ = 



where the elements of the 4x4 unitary matrix U are Clebsch-Gordan 
coefficients {S M\simiS 2 m 2 ) ■ The reduction entangles states that con- 
tribute to more than one irreducible representation. In particular, the 
S,M = 1,0 and S,M = 0,0 states are fully entangled. Consider the 
spin-0 part: 



( 0 ) _ 
AB — 



T(l®e, 



ei (g) 1) (Ps ©Ps) 



with a spin density 



8 % = V’abV'ab = ^ (1 - ei © ei) (P3 © P3 + P3 C P3) • 



Note that V'ab singlet is antisymmetric under interchange, and 

is symmetric. One can also verify that V'ab invariant under 
rotations: 8^ab idempotent and 

may be viewed as a 2-spin (or 2-qubit) projector. Thus, represents 
a pure state. Fully entangled states are pure. However, mixed states 
can also be partially entangled. 

Consider the triplet state with Mg = 0. This is easily found within a 
normalization factor by starting with the (S,M) = (1, 1) state 



V;Pd) =P3^P3 



and applying the lowering operator [compare (49)] 5_ = eiPs © 1 -|- 1 © 
eiPs : 






(bo) 

AB 



(bO) 

8ab 



^ (1 (g) ei -b ei (g) 1) (P3 (g) P3) 

V’ab ^ (1 -b ei (g) ei) (P3 (g) P3 -b P3 © P3) • 



Note ^ (1 -b ei (g) ei) and ^ (1 — ei (g) ei) are complementary 2-spin pro- 
jectors. 

A mixture of equal amounts of fully entangled (and hence nonsepara- 
ble) singlet and triplet states with M = 0 gives the state 



" =2 



( 0 ) , ( 1 , 0 ) 

8ab + 8ab 



= - (P3 <g> P3 + P3 C) P3) 



(50) 
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This is a separable state, that is a mixture of decomposable (uncorre- 
lated) states. Through the mixture, the two spins do become correlated, 
but not entangled. The correlation is purely classical. 

7.5 Total Spin and Nonlocal Projectors 

However, we can project an entangled state out of the separable one 
( 50 ) by applying a nonlocal projector. Consider the singlet-state projec- 
tor for the two-spin system: 

P 5=0 = 1 - 

where 

1 ^ 1 

52 = ^ = - ^ (ej ® 1 + 1 O ejf = - ^ (1 + ej ® e^) 

j i=i j 

is the square of the total spin operator. Substitution of 5 ^ into P5=o 
gives directly the isotropic form 

P 5 =o = 1 - ^ [3 -h ei (g) ei -h 62 (8) 62 -h 63 (g) 03] 

= ^ [1 - (61 (g) 61 -h 62 (g) 62 -h 63 (g) 63)] . 

Applying Ps’=o to g' ( 50 ) and noting that, with applications of the pac- 
woman property, 

P5=o (P3 <g) P3 + P3 C) P3) = 2 (^3 <g) P3 + P3 C) P3) 

we find 

P 5 =o^^Ps=o = Q [1 “ ei] (P3 (g) P3 -I- P3 (g) P3) [1 — 61 (g) 61] 

O 

= ^ [1 - 61 (g) 61] (P3 (g) P3 -h P3 (g) P3) , 

which is just the singlet contribution to g' ( 50 ). Note also that 

P 5 =i = 2*^^ = ^ “ P 5 =o 
is the projector onto the triplet state, and 

T3 /T3 1 ( 1 . 0 ) 

Ps=iqPs=i = i^Qab ■ 

The spin projectors P5=o and Ps’=i are examples of nonlocal projectors 
that can project unentangled states into entangled ones. 
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8. Conclusions 

We have derived many quantum results for a classical system. The 
quantum/classical interface has become almost transparent in the spinor 
approach of Clifford’s geometric algebra. Quantum effects demand am- 
plitudes, and these are provided classically by rotors of the geometric al- 
gebra. In particular, the eigenspinor is an amplitude closely associated 
with quantum spinor wave functions. Linear equations for the classical 
eigenspinor suggest superposition and hence the possibility of quantum- 
like interference. Projectors onto minimal left ideals of the algebra are 
needed to reduce the spinor representation of Lorentz rotations in the 
algebra and to simplify the equation of motion, and by using them, many 
of the classical results take quantum form. The association helps clarify 
various quantum phenomena, and extensions to multiparticle systems 
promise to demystify such quantum phenomena as entanglement. The 
Q/C relation appears to be much deeper than superficial. The basic 
spinor representation gives spin 1/2 and changes in sign for rotations of 
360°, and the linearity of the equation of motion gives g = 2. 

While our results are concordant with most of quantum theory, there 
are some important differences in interpretation. In particular, fermion 
spin is given a classical foundation. Although quantum texts often give 
the magnitude of the spin as \/3^/2, there is no indication of any length 
greater than /i/2 in the classical approach. Since the projection of the 
spin in any direction is /i/2, a larger magnitude would process about the 
quantization axis. Such a precession should have observable effects, but 
none has evidently been seen. The classical approach indicates that the 
prediction of a larger size is erroneous, arising from a misunderstand- 
ing of the spin component operators in an ideal spinor representation. 
A related statement from conventional quantum theory is that only one 
component of the spin can have a definite value at a time. This is usually 
justified by the uncertainty relation, which predicts uncertainties in the 
X and y components of the spin in any eigenstate of the z component. 
The classical approach suggests that in such an eigenstate, the x and y 
components vanish and are therefore definite. However, any such state 
is a linear combination of spin up and down along ei and any measure- 
ment of the X component forces the system into one of these states. The 
uncertainty relation is therefore correct but its interpretation question- 
able. The classical interpretation is that a spin direction s exists for each 
pure state, and this is reinforced by the spin density operator approach 
in quantum theory. The spin density operator for a pure quantum state 
is ^ (1 + s) ) which indicates a spin direction s identical to 

Re^R} in the classical approach. Although g cannot be determined by 
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a single measurement, it can be found by a set of measurements on dif- 
ferent samples of a pure-state ensemble, in complete analogy with the 
process of measuring the Stokes’ parameters in a fully polarized beam of 
light. The mathematics is in fact identical to the treatment of polarized 
light and our interpretation should probably be analogous. 

The quantum age has started and quantum technology is on its way. 
It’s time to shift gears into the paradigm of relativistic quantum theory, 
on which the quantum age is based. APS formulates mechanics in a 
fashion that naturally incorporates relativity and much of the structure 
of quantum theory. It seems destined to play a central role in the revision 
of undergraduate curriculum needed to complete the paradigm shift. 
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Abstract In this work we develop the family of Prometheus orthonormal sets 
(PONS) in the framework of certain abelian group algebras. Classical 
PONS, introduced in 1991 by J. S. Byrnes, will be constructed using 
group algebra operations. This construction highlights the fundamental 
role played by group characters in the theory of PONS. In particular, 
we will relate classical PONS to idempotent systems in group algebras 
and show that signal expansions over classical PONS correspond to 
multiplications in the group algebra. 

The concept of a splitting sequence is critical to the construction 
of general PONS. We will characterize and derive closed form expres- 
sions for the collection of splitting sequences in terms of group algebra 
operations and group characters. 

The group algebras in this work are taken over direct products of the 
cyclic group of order 2. PONS leads to idempotent systems and ideal 
decompositions of these group algebras. The relationship between these 
special systems and ideal decompositions, and the analytic properties 
of PONS, is an open research topic. A second open research topic is the 
extension of this theory to group algebras over cyclic groups of order 
greater than 2. 
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1 . Introduction 

PONS, the Prometheus Orthonormal Set, has undergone considerable 
refinement, development, and extension since it was introduced in [6]. Its 
application as an energy spreading transform for one-dimensional digital 
signals is discussed in [10], with further details of this aspect presented 
in the patent [9]. The construction of a smooth basis with PONS-like 
properties is given in [8] , thereby answering in the affirmative a question 
posed by Ingrid Daubechies in 1991. A conceptually clear definition of 
the PONS sequences that comprise the original symmetric PONS matrix, 
via polynomial concatenations, is also given in [8]. An application to 
image processing of a preliminary version of multidimensional PONS 
can be found in [7]. An in-depth study of multidimensional PONS is 
currently being prepared. Proofs of the results given below will appear 
elsewhere. 

2. Analytic theory of one-dimensional PONS 

First we provide an account of some of the most basic mathemati- 
cal concepts and results on one- dimensional PONS sequences and the 
related PONS matrices. 



2.1 Shapiro transforms of unimodular sequences 



The whole mathematical theory of the PONS system, and also its ap- 
plications to signal processing, turn out to derive from one fundamental 
idea, that of the Shapiro transform of a unimodular sequenee. We begin 
by describing it. 

Let {aQ,a\,a 2 , ■ ■ ■) be any infinite sequence of unimodular complex 
numbers. Then a sequence {Pm, Qm) of polynomial pairs (with unimod- 
ular coefficients and common length 2™) is inductively defined as follows: 
Po{x) = Qo{x) = 1, and: 



/ Pm+l(2^) — Pm{x) -|- eXmX Qmix} 
\ Qm-i-l{x) — Pm{x) C^mX Qm{x) 



for all integers m > 0. (1) 



Since Pm{x) is a truncation of Pm+i{x) for every m > 0, it follows 
that there is an infinite sequence {(do, fdi, (^ 2 , ■ ■ ■) of unimodular complex 
numbers which only depends on the given sequence (cco; cii, CC 2 , • • •) and 
such that for each m > 0 the first polynomial Pm of the pair {Pm,Qm) is 
always the partial sum of length 2™ (i.e., of degree 2”^ — 1) of the unique 
power series YlT=o/^kX^- Note that such a property does not hold for 
the polynomials Qm, since Qm is not a truncation of Qm+i- 

The explicit construction of the Shapiro transform {l3k)k>o ™ terms 
of the original unimodular sequence {ctm)m>o follows: Let k = 
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Z^r>0 ■ 2^ denote the expansion of an arbitrary integer A: > 0 in base 

2 (so that the binary digits^' Sr take only the values 0 and 1, and 5^ = 0 
for all r > log /c/ log 2). Then we have 



A = n 

r>0 

where 

€k = (— (the classical Shapiro sequence [21]). (3) 

We call (/3fc)fc>o the Shapiro transform of the sequence {am)m>o- In 
particular, when am = 1 for all m > 0, we have (3k = (the classical 
Shapiro sequence). 

The Shapiro transform power series" and the related 

polynomial pairs (PmiQm) have, respectively, all the remarkable prop- 
erties of the classical Shapiro power series YlT=o and those of the 
classical Shapiro polynomial pairs. (We will recall these properties in the 
following section, on the “original PONS matrix”). In addition, all the 
unimodular complex numbers ao,ai,a 2 ,--- are at our disposal, which 
can be useful in many situations. For our present purposes (PONS con- 
structions) the parameters ao,ai, 02 , ■ ■ ■ will only take the values ±1. 
But the case when am = ±1 or ±i can also be useful in signal process- 
ing (and leads to PONS-type Hadamard matrices with entries ±1, ±i). 
Other choices of the unimodular parameters ao, ai, a 2 , ■ ■ ■ have other 
interesting applications that will be dealt with elsewhere. 

2.2 The original PONS matrix of order 2"* 

Before giving, in section 2.4, several (essentially equivalent) definitions 
of PONS matrices in full generality, and indicating structure theorems 
for such general PONS matrices, we start by recalling the original PONS 
matrix constructed in 1991 and published in 1994 [6]. Indeed, the proofs 
of structure results for general PONS matrices make use (in the inductive 
arguments) of properties of this original matrix of order 2™, which we 
will denote by (There is no risk of confusion with the previously 

defined polynomial Pm)- 

The original way of defining the P 2 ™ uses an inductive method based 
on the eoneatenation rule: 





( ^ 


S \ 


M A 


A 


-B 


V ^ J 


B 


A 






-a) 



( 4 ) 
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where A and B are two consecutive matrix rows. More precisely, we 
start with the 2x2 matrix 




(where, henceforth, we write + instead of +1 and — instead of —1, to 
ease notation). Thus the two rows of P2 are A = (++) and B = (H — ). 
Then the rule ( 4 ) means that the first row of the next matrix, P4, is 
the concatenation of A and B, which here is (+ + H — ); the second row 
of P4 is the concatenation of ^ = (++) and —B := ( — h), which is 

therefore (+ H h); the third row of P4 is the concatenation of B and 

A, i.e., (H 1 "+); and finally the fourth row of P4 is the concatenation 

of i? = (H — ) and —A := ( ). Thus 



Pa := 



+ + 
+ + 
+ - 
+ - 




To obtain the next matrix Pg, we first take the pair A, B to be the first 
two rows of P4 (in that order) and use the concatenation rule ( 4 ) to 
obtain the first four rows of Pg. Then we take the pair A, B to be the 
next two rows of P4 (in that order), and use the concatenation rule ( 4 ) 
to obtain the next four rows of Pg. Thus 



/+ + + - + + -+ \ 

+ + + -- - + - 

+ + - + + + + - 




Similarly, the first two rows of Pg and the concatenation rule ( 4 ) yield 
the first four rows of Pie, and so on. Already, from this definition, one 
can deduce many fundamental properties held by P2™, (m = 1 , 2 , 3 ,.. .). 

We list some of them and indicate later on that many of these prop- 
erties also hold for the most general PONS matrices that we shall define 
and characterize below. In addition, some of these properties (for exam- 
ple, the extremely important bounded crest-factor property for all finite 
sections) will also be seen to be valid for some very broad classes of 
Hadamard matrices generalizing the PONS matrices. 
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We note that, in the context of codewords, and without reference 
to mathematical properties discussed herein, or to the original Shapiro 
polynomials [21, 20], a basically identical matrix to Pp was constructed 
by Welti [25]. Several additional authors [13, 14, 23, 26] have also re- 
ported on these Welti codes and their application to radar, and others 
[5, 3, 4, 17, 19, 15, 18, 16] have discussed communications applications 
of similar constructions. 

2.3 Some properties of the original PONS 
matrix Pj,, (X = 2”*) 

As we said, most of the properties below will be seen to hold for the 
most general PONS matrices (that we shall define later, in section 2.4) 
and even for some very broad generalizations of PONS. 

Property 1 Pp, with L = 2™, is a Hadamard matrix of order 2™. 

Property 2 Suppose the rows of Pp are ordered from 0 to L — 1 (i.e., 
the first row has rank 0 and the last row has rank L—1). Denote by Ar{z) 
the polynomial “associated” to the r-th row (i.e., Ar{z) = X]fc=c) 

(ao,oi, . . . ap-i) denotes the r-th row, r = 0, 1, . . . L—1). It is well known 
[2] that, with this notation, A\{z) = {—1)'^^^Aq{—z) where A*{z) = 
2 ;deg"^A(l/ 2 :) denotes the “inverse” of the polynomial A{z) . This is a fa- 
mous identity on the classical Shapiro pairs. Property 2 is that a similar 
identity A2r+i{z) = Xm,rA2r{—z) holds for all r = 0 , 1 ,... L/ 2 , where 
Xm,r is an extremely interesting number (with values PI) expressible in 
terms of the “Morse sequence”. The Morse sequence is the sequence 
of coefficients in the Taylor (or power series) expansion of the infinite 
product n^o “ z"^'"). 

Property 3 With the previous notation, for every r = 0, 1, . . . L/2 the 
polynomials A 2 r{z) and A 2 r+i{z) are “Fejer-dual” (or “dual” for short), 
that is, 

\A 2 r{z)'\^ + \ A 2 r-\-i(z)\'^ = Constant (= 2L, in this case) (5) 

for all z € C with \z\ = 1. Equivalently, the {2r)-th row and the (2r-|-l)- 
st row are always “Golay complementary pairs” [12]. 

Property 4 [Much related to Properties 2 and 3] Every row-polynomial 
Ar{z) is QME, that is, 

\Ar{z)\‘^ + \Ar{—z)\'^ = constant (= 2L in this case) 



( 6 ) 
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Property 5 (The “splitting property” of rows) For every r = 
0, 1, . . . ,L — 1, the two “halves” of the row-polynomial Ar{z) are dual, 
eaeh of these two halves has dual halves, eaeh of these halves (i.e., “quar- 
ters” of Ar{z)) has dual halves, and so on. This “splitting property” is, 
by far, the most important property, in view of its applieations to “en- 
ergy spreading” . It extends to general PONS matriees and to a broader 
elass of PONS-related Hadamard matriees that we will eonsider later. 

Property 6 (The “QMF-splitting property”) This is a finer 
form of the “QMF property” (Property 4) and an analog of the “splitting 
property” just deseribed. We will postpone its definition until we eome 
to the strueture results for general PONS matriees (in seetion 2. 4). 

Property 7 (The “Hybrid splitting property”) This is also 
a finer form of the “QMF property (above Property 4), and is as follows: 
Every row-polynomial Ar{z) is QMF, i.e., (6) holds; and if we split Ar{z) 
into two halves of equal length, then eaeh of those halves is QMF. If we 
split these halves into halves of equal length, these in turn are QMF, and 
so on. We will return to this property in seetion 2.4 when we deal with 
the strueture results for general PONS matriees. 

Property 8 (The “gonstant row-sums property”) Ifm is even, 
then eaeh row- sum of Pl (with L = has the eonstant value ^/L = 
2 m/ 2 _ jjffi ig then the row sums are either zero or \/2L = 

This property is important but easy to eheek. However, this is a very 
speeial ease of the deep (and still partly open) problem of the values of 
row- polynomials at various roots of unity. 

Property 9 (“Bounded grest factor properties”) Not only is 
it true that every row-polynomial have erest faetor < \f2, but also every 
finite seetion of sueh a polynomial has erest faetor not exeeeding some ab- 
solute eonstant C (the optimal value of whieh we have determined). We 
point out that these extremely important properties (for “energy spread- 
ing”) are elosely related to Property 5 (splitting properties of rows). 

Property 10 (“Correlation properties”) These are diseussed in 
seetion 2.6. 

Property 11 (“Dual-companion uniqueness”) Every row has ex- 
actly one “eompanion row”, that is, if Ar{z) and Ag{z) denote the as- 
soeiated row-polynomials, then |H^(e**)| = |Hs(e**)| for all t G M. These 
two rows are “mirror images ” of eaeh other, exeept for a possible mul- 
tiplieation by —1. This possible sign ehange, and also the distribution 
of the loeation of s in terms of r, are quite surprising and ean be ex- 
pressed, here also, in terms of the Morse sequenee. As a eorollary, every 
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row has exactly two “duals” (or, equivalently, two rows which are Golay- 
complements to it [12]). 

2.4 General definition of PONS matrices 

In section 2.3 we described the original PONS matrix and some of its 
properties. Before considering any other special PONS matrices (such 
as, typically, the symmetric PONS matrices the very existence of which 
is really surprising), we will give a general definition of PONS matrices 
and also consider some of their useful generalizations. It is convenient 
to start by considering three very broad classes of finite sequences of 
length 2"^. These have an independent interest, with or without regard 
to Hadamard or PONS matrices. 

So, let S = (oo, ai, . . . a^-i) denote any complex- valued sequence of 
length L = 2"^, {m >1). Its “associated polynomiaP (or “generating 
polynomial) is 

L-l 

P{z) ■= '^akz’'. 

k =0 

The two “halves” of P{z) are: 

L/2-1 L-l 

A{z) = OkZ^ and B{z) = a^z^. 

k =0 k = Lj 2 

Definition 12 The sequence S is said to have the “splitting property” 
(or to he a “splitting sequence”) if its generating polynomial P{z) has 
“dual” halves, that is, 

L-l 

|yl(e**)p -|- |il(e**)p = constant ( = ||D ||2 = |ofcP) necessarily), 

k =0 

and if each of the halves A{z) and B{z) has dual halves, and so on. 

Definition 13 The sequence S is said to have the “QMF splitting prop- 
erty” (or to he a “QMF splitting sequence”) if, first of all, it is QMF, 
which means that P{z) and P{—z) are dual, or, equivalently, that the 
even-index component C{z) := J2k=o ^ ^ 2 kZ^ and the odd-index compo- 
nent D{z) := Y[k=o ^ ci 2 k+iz^ are dual; and if, in turn, both of C{z) and 
D(z) are QMF; and so on. 

Definition 14 The sequence S is said to have the “hybrid splitting 
property” (or to be a “hybrid splitting sequence”) if it is QMF (i.e.. 
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P{z) and P{—z) are dual), and if it has QMF halves (i.e., A{z) and 
A{—z) are dual and also B{z) and B{—z) are dual), and if eaeh of the 
halves has QMF halves, and so on. 

We point out that these are three (overlapping but) pairwise distinet 
classes, even if S is assumed to only take the values ±1 (as long as 
L > 16). The smallest L = 2”^ for which one can find examples of ±1 
sequences of length L satisfying one of the above conditions and not 
the other two is precisely L = 16. However, we have the following two 
theorems: 

Theorem 15 For any integer m > A, there are Hadaniard matriees of 
order L = 2”^ all of whose rows satisfy the requirements of any of the 
above three definitions, hut not those of the other two. 

Thus we obtain three pairwise distinet (and very broad) classes of 
Hadamard matrices of order L = 2™, (L > 16), which we call respec- 
tively: 

(A) the class of 2”^ x 2”^ Hadamard matrices with splitting rows" . 

(B) the class of 2™ x 2”^ Hadamard matrices with QMF- splitting 
rows” . 

(C) the class of 2”^ x 2”^ Hadamard matrices with ^Fybrid splitting 
rows” . 

Our Theorem 16, stated below, says that the intersection of any two 
of the above three classes of Hadamard matrices is contained in the 
third class. (This will lead to the notion and complete identification 
of all general PONS matriees”). We note that none of the classical 
Walsh-Hadamard matrices [1] lies in any of these three classes. 

Theorem 16 (Uniqueness and structure theorem) for General 
PONS Matrices Suppose that all the rows of some 2^ x 2”^ Hadamard 
matrix P have any two of the above three properties (A), (B), (C). 

Then all the rows of P also have the third property, that is, all three of 
(A), (B) and ( C) are satisfied by all the rows. In that ease the rows of P 
eonstitute some permutation of the rows of the “original PONS matrix” 
Pl, with L = 2^, after some of these rows have possibly been multiplied 
by -1. 

Remark 17 The eonverse of Theorem 16 is obvious, in view of the prop- 
erties of Shapiro transforms of ±1 sequenees. 




PONS, Reed-Muller Codes, and Group Algebras 



163 



The proof of Theorem 16 is non-trivial. It is done by induction, and 
it uses (as a lemma) Property 11 of the “original PONS matria?’’ Pp and 
the fact that each row-polynomial Ar{z) has exactly two duals At{z) 
which are of the form dnAr{—z) or dnA*{—z). This lemma, in turn, rests 
on another lemma which involves the exact computation of the greatest 
common divisor between any two row-polynomials of Pp- 

Definition 18 A general PONS matrix of order 2™ is any 2™ x 2™ 
Pladamard matrix satisfying the conditions of Theorem 16. 

Corollary 19 All the rows of any 2™ x 2™- general PONS matrix have 
all those properties of rows of the original PONS matrix Pp which are 
invariant by permutations of rows and sign changes. 

At this stage we mention the following result which also illustrates 
some structural difference between the PONS matrices and its general- 
izations. 

Proposition 20 The largest length of runs of equal consecutive terms 
in any PONS-matrix row is 4- The largest length of runs of equal con- 
secutive terms in any row of a “splitting Pladamard matrix” is 6. 

2.5 Symmetric PONS matrices 

If the “concatenation rule” (4) of section 2.2 is replaced by the new 
concatenation rule 




with the same starting matrix 




then the result is a sequence of 2™ x 2"^ symmetric matrices which are, 
by construction and in view of the above Theorem 16, PONS matrices. 
A consequence of this extremely unexpected result is that, by standard 
operations on rows and suitable choice of the parameters aj of section 
2.2, we obtain on the order of 2^ symmetric 2"^ x 2"^ PONS matrices. 

2.6 Correlation results 

As one would expect in view of the crest factor and energy spreading 
results referred to previously, PONS sequences have interesting correla- 
tion properties. 
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Theorem 21 Let (oq, oi, . . . az,_i) be any row of any L x L PONS ma- 
trix (L = Let 

L-l-j 

Cj ^ ^ 0,kOtk+j (j 1, 2, ... T 1) 

k=0 

denote the j-th out-of-phase aperiodie autoeorrelation of that PONS row. 
Then we have the “maximal” estimate 



max \cA < K ■ 

where K is an absolute eonstant. (The exponent 0.73... arises from the 
eomputation of the speetral radius of some PONS-related matrix.) 



This is indeed a difficult result, and the exponent 0.73... can be proved 
to be optimal. The proof involves, in particular, heavy inequalities on 
norms of matrix products. Also the proof becomes somewhat less tech- 
nical if, instead of the optimal exponent 0.73..., we only with to obtain 
the (slightly less good) exponent 3/4. 

We also point out that a statement similar to our Theorem 21 was 
published in 1997 by another researcher [22], but with a grossly mistaken 
“proof” (based on the statement that the spectral radius of any matrix 
product is independent of the order, which is trivially false if there are 
more than 2 terms). 

Theorem 22 Let (ao, oi, . . .ai-i) be any row of any L x L PONS ma- 
trix /L = 2™ > 4/. Let 



Aj 'y ^ ^k^k+j {j 1, 2, ... T 1) 

k (modulo L ) 



denote the j-th out-of-phase periodie autoeorrelation of that PONS row. 
Then 



7i 



±4c' ifLl^<j<Ll2 
0 otherwise 



where the c'- denote the aperiodie autoeorrelations of some PONS row of 
length LjA. 



While the proof of this Theorem 22 is much easier than that of The- 
orem 21, the two theorems put together imply the estimate 

max | 7 , | < A' • 

for the periodie autocorrelations, with the constant K' = K ■ 
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Figure 1. Average auto- and cross-correlation magnitudes for PONS and Walsh 

Theorem 23 Both for the periodic autocorrelations 'jj and the aperiodic 
autocorrelations Cj of any PONS row of length L, we have 

L-l L-l 

= qL‘^ + 0{L) and = -L‘^ + 0{L). 

i=i i=i 

We mention that similar expressions for cross-correlations (both peri- 
odic and aperiodic) of any two distinct rows of the same PONS matrix 
oscillate between -|- O (L) and -|- O (L). 

As for “maximal theorems” for cross-correlations (that is, results anal- 
ogous to the above Theorem 21 and its corollary for periodic autocorre- 
lations), further research has to be done. Figure 1 compares the average 
magnitudes of all periodic autocorrelations and cross-correlations of the 
Walsh-Hadamard and PONS matrices. Note that, as implied by Theo- 
rem 22, for all PONS matrices all periodic autocorrelations from —1/4 
to -1-1/4 of the length equal 0, except (of course) for the central one. 

3. Shapiro sequences, Reed-Muller codes, and 
functional equations 

We begin our journey into the more algebraic aspects of PONS se- 
quences by discussing their surprising relation to Reed-Muller codes. 
Let be the set of binary 2^-tuples, m > 1. 

For each n, 1 < n < 2™ — 1, and each j, 1 < j < m, let dj^n be the 
coefficient of in the binary expansion of n and define (5o,n to be 
1, 0 < n < 2"^ — 1, so that 



n = ^2^ 
i=i 
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Define vectors gj by 

9j ~ 9ji^) ~ { ^j,o ^j,i ^j,2 ■ ■ ■ )) 

90 = 9o{m) = (111... 1). 

and the matrix Gm by 

Gjn = { 9 O 7 91} ■ ■ ■ 7 9m} 

Example: m = 3 



n 


0 


1 


2 


3 


4 5 


6 


7 


90 


1 


1 


1 


1 


1 1 


1 


1 


91 


0 


1 


0 


1 


0 1 


0 


1 


92 


0 


0 


1 


1 


0 0 


1 


1 


93 


0 


0 


0 


0 


1 1 


1 


1 


: (1 


11 


11 


11 


1), 


(01 


OK 


3101 


(0 


lOl 


10 


01 


1), 


0 

0 


0 

0 


fill 



The Qm are discretized versions of the Rademacher functions. 

Claim. The elements of Gm are linearly independent. 

The Reed-Muller code of rank m and order 0 is 

RM{0,m) = {(00 ... 0), (11 ... 1)}, 

where each vector (codeword) has 2™ entries. RM(l,m) is the subgroup 
of generated by the codewords in Gm, he., the vector space over Z 2 
spanned by these codewords. RM(l,m) contains codewords. 
Define multiplication • on Z2 by 

{xqxi . . . X2m-i) ■ {yovi ... y2--i) = {xovoxiyi . . . X2m-iy2--i)- 

Augment Gm with all products gi ■ gj, 1 < i < j < m, to form Gm . 
Example: m = 3 



n 


0 


1 


2 


3 


4 


5 


6 


7 


90 


1 


1 


1 


1 


1 


1 


1 


1 


91 


0 


1 


0 


1 


0 


1 


0 


1 


92 


0 


0 


1 


1 


0 


0 


1 


1 


93 


0 


0 


0 


0 


1 


1 


1 


1 


9i ■ 92 


0 


0 


0 


1 


0 


0 


0 


1 


9i • 93 


0 


0 


0 


0 


0 


1 


0 


1 


92 ■ 93 


0 


0 


0 


0 


0 


0 


1 


1 
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= GsU{( 00010001), (00000101), (00000011)}. 

Claim. The 1 + m + (™) elements of are linearly independent. 
RM{2,m) is the subgroup of generated by the codewords in G^m ■ 
RM{2,m) contains 2 ) codewords. 

Augmenting Gm with all products of the form gi ■ gj ■ gk, 1 < i < j < 

(3) 

k < m, and continuing as above we get G\n , RM{3,m), etc. 

Theorem. RM{k, m) for m>l, 0</c<misa subgroup of con- 
sisting of 2-^ codewords, where N = (T) • minimum Hamming 

weight (he., number of ones) of the nonzero codewords in RM{k,m) is 

Proof. Exercise, or see Handbook of Coding Theory, V. Pless and 
W.C. Huffman, Editors, Vol. 1, pp. 122-126. 

Let’s examine a particular element Sm £ RM{2,m) given by 



m— 1 

Sm = ^ 9j ■ 9j+l = {so Si ... S2™-l). 



Example, m = 3 



n 


0 


1 


2 


3 


4 


5 


6 


7 


91 


0 


1 


0 


1 


0 


1 


0 


1 


92 


0 


0 


1 


1 


0 


0 


1 


1 


93 


0 


0 


0 


0 


1 


1 


1 


1 


S3 


0 


0 


0 


1 


0 


0 


1 


0 



Let 4>{n) be the number of times that the block H = [1 1] occurs in the 
binary expansion of n, 0 < n < 2™' — 1. 

Claim. 



Sn 



0 if (p{n) is even 

1 if (^{n) is odd. 



Let Qm = {70) 7 i) 72, • • • , 72 ’"-i| be the subgroup of RM{1, m) generated 
by 9ii 92: • • • : 9m- 
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Example, m = 3 





n 


0 


1 


2 


3 


4 


5 


6 


7 










91 








0 


1 


0 


1 


0 


1 


0 


1 










92 








0 


0 


1 


1 


0 


0 


1 


1 










93 








0 


0 


0 


0 


1 


1 


1 


1 




70 = 


= 0 


91 


+ 0 


■92 


+ 0 


93 


0 


0 


0 


0 


0 


0 


0 


0 




71 = 


= 1 


91 


+ 0 


■92 


+ 0 


93 


0 


1 


0 


1 


0 


1 


0 


1 




72 = 


= 0 


91 


+ 1 


■92 


+ 0 


93 


0 


0 


1 


1 


0 


0 


1 


1 




73 = 


= 1 


91 


+ 1 


■92 


+ 0 


93 


0 


1 


1 


0 


0 


1 


1 


0 




74 = 


= 0 


91 


+ 0 


■92 


+ 1 


93 


0 


0 


0 


0 


1 


1 


1 


1 




75 = 


= 1 


91 


+ 0 


■92 


+ 1 


93 


0 


0 


1 


1 


0 


0 


1 


1 




76 = 


= 0 


91 


+ 1 


■92 


+ 1 


93 


0 


0 


1 


1 


1 


1 


0 


0 




77 = 


= 1 


91 


+ 1 


■92 


+ 1 


93 


0 


1 


1 


0 


1 


0 


0 


1 



We now switch gears slightly, by rewriting all codewords in RM{k,m) 
by mapping 0 ^ 1, 1 ^ — 1. Since gi,g2, ■ ■ ■ ,9m are discretized versions 
of the Rademacher functions, are discretized versions 

of the Walsh functions. That is, Qm is the 2”^ x 2”^ Sylvester Hadamard 
matrix, which we relabel ffm. 

Example. 



1111 
1-1 1-1 
1 1 - 1-1 
1 - 1-1 1 
^ 3-1 1 1 1 

1 1 - 1-1 
1 1 - 1-1 
1 - 1-1 1 



1 1 

1 -1 

1 1 

1 -1 

-1 -1 

1 1 

-1 -1 

-1 1 



1 1 

1 -1 

-1 -1 

-1 1 

-1 -1 

-1 -1 

1 1 

1 -1 



Now Sn = 

Since we have 



S2n — Sn; 



S2n+1 — 




if n is even 
if n is odd 



the binary expansion of 2n is the binary expansion of n shifted one 
slot to the left with a 0 added on the right and the binary expansion of 
2n + 1 is the binary expansion of n shifted one slot to the left with a 1 
added on the right. If n is even this does not change 0(n). If n is odd 
(i.e., n ends in 1) then 4>{2n + 1) = 4>{n) + 1. 
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OO 

9{z) =^SnZ"^. 
n=0 

It can be shown that g{z) satisfies the functional equation (FE) (Brill- 
hart and Carlitz) 

g{z) = g{z‘^) + zg{-z^). 

Iterate this FE: 

g{z^) = g{z‘^) + z‘^g{-z'^) 
g{-z^) = g{z‘^) - z^g{-z'^), so 

g{z) = {l + z)g{z^) + z\l-z)g{-z^). 

Repeat: 

g{z^)=g{z^)+Z^g{-Z^) 
gi-z'^) = g{z^) - z^g{-z^), so 

g{z) = {l + z + z^ - z^)g(z^) + z^(l + z - z^ + z^)g(-z^). 

Continuing we see that, beginning with 

g(z) = A{z)g{z^^) + z^^'" B{z)g{-z^^) 



and applying 




, 2”i+ls 2 ^ / 2™+l\ 

= g{z ) + z g{-z ) 

2 ™+l\ 2 *" / 2 ”"+l\ 

= g{z )-z g{-z ) 



we get at the next step 



giz)=[A{z) + z‘^^-'B{z)] g{z‘^’^^') 

+ z^’^[A{z)-z^"^-"B{z)]g{-z^"^^') . 

Renaming the initial A{z) and B{z) to ^0(2^) and Qo{z), respectively, 
and naming the (polynomial) coefficients of g{z‘^'^) and g{—z^'^) Pm-i{z) 
and Qm-i{z), respectively, m > 1 , the above yields 



^0(2) = Qo{z) = 1 
Pmiz) = Pm—l{z) + 2 Qm—l{z) 

Qm{z) = Pm-l{z) - ^Qm-l{z) 
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Thus, the {Pm{z)}^^Q and {Qm(-2)}m=o precisely the Shapiro Poly- 
nomials! Pm{z) and Qm{z) are each polynomials of degree 2”^ — 1 with 
coefficients ±1. For each m, the first 2™ coefficients of g{z) are exactly 
the coefficients of Pm{z). Therefore, for each m, the 2™-truncation 

( So 'Sl • • • S2"*-l ) 

of the Shapiro sequence {sj}^q is an element of RM{2,m). 

Why might that be important? 

Recall the fundamental property of the Shapiro polynomials, namely 
that for each m Pm and Qm are complementary: 

\Pm{z)\'^ + \Qm{z)f = 2™-+^ for all l^l = 1. 

Consequently Pm and Qm each have crest factor (the ratio of the sup 
norm to the norm on the unit circle) bounded by \/2 independent 
of m. be.. Pm and Qm are energy spreading. So the coefficients of Pm 
are an energy spreading second order Reed-Muller codeword. Related 
results may be found in [11, 24]. 

Also, letting /ij, 0 < j < 2™ — 1, denote the rows of H^, the matrix 
whose rows are Sm ■ hj , is a PONS matrix. Its 2”^ rows can be split into 
2 m-i pg^jj-g q£ complementary rows, with each row having crest factor 
(bounded by) \/2. 

Since each hj G RM{l,m) and Sm G RM{2, m), the (rows of the) PONS 
matrix is a coset of the subgroup RM{l,m) of RM{2,m). 

Thus we have constructed 2™ (really 2™+^ by considering — H^) energy 
spreading second order Reed-Muller codewords. 

Note that blocks other than B = [11] appear in connection with 
higher-order Reed-Muller codes. For example, the block [1 1 1] yields 
codewords in RM(3,m). The generating functions of these blocks sat- 
isfy similar (although more complicated) FE’s. An open question is 
whether these FE’s yield corresponding crest factor bounds for subsets 
of RM{k,m), k > 3, resulting in higher-order energy spreading Reed- 
Muller codes. 

Blocks and FE’s 

Let B = [/?! /?2 . . . /3r]) Pj = 0 or 1, /?i = 1 be a binary biock and 
N = N{B) = Pr + 2/?r-i -I- ... -I- 2^~^Pi be the integer whose binary 
expansion is B. Let be the number of occurrences of B in the 

binary expansion of n and let fs{z) be the generating function of 'Lb, 

OO 

fB{z) = '^^B{n)z'^ . 

n=0 
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n 

fsiz) = (1 + z)fB{z^) + ^ 

1 — 



Now consider the parity sequence of Ssin) = (— l)’^s(’^), and its 

generating function qb{z) = Yl^=o^B{n)z^. For the general case it will 
again be useful to split qb into its even and odd parts, 



Eb{z) = Y,^B{2n)z‘^^ 

n=0 

oo 

Ob(^) = J^5s(2n + l)z2-+i 

n=0 



Previous example: B = [11], 8B{n) is the Shapiro sequence, qb^z) 

satisfies the FE qb{z) = gB{z^) + zgB{—z‘^). 

Example: il = [l]. 

As before, ^B(2n) = ^B{n) and ^B{2n+1) = 'I'b(?^) + 1 so that (writing 
5n for 5B{n) to ease notation) 62 n = 8 n, (^ 2 n+i = — 8 n- Hence Eb{z) = 
gB{z‘^), Ob{z) = -zgBiz"^), and we have the FE gB^z) = (1 - z)gB{z^)- 
Iterating, gB{z) = (1 — z){l — z‘^){l — z‘^) . . . and is the Thue-Morse 
sequence [1 — 1 — 11 — 111 — 1 ...]. We drop the subscript B from now 
on. 

Example: = 0. 

^'(2n+l) = ^'(n), so 52n+i = so 0{z) = zg{z^). Since g{z)-g{-z) = 
20(z) we have the FE g{z) = g{—z) + 2zg{z'^). 

Example: (5r = 1- 

As above, now g{z) = —g{—z) + 2g{z‘^). 

Example (a typical case? ): H = [110010],r = 6. 

'h(2n + 1) = 'h(n). 'h(2n) = 'h(n) unless the binary expansion of n ends 
in [110 01], i.e., unless n = A(mod 2^), where A' = 2^ + 2^ + 2° = 25, 
in which case \h(2n) = il'(n) + 1. So 



<^2n+l — 8n, ^271 



j — 6 n if n = 25(mod 32) 
[ Sn otherwise 



So 0{z) = zg{z‘^). 
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OO OO 

E{z)=Y,S2nZ^" = Y.5,,z^^-2 Y. 

n=0 n=0 n=25(mod32) 

OO 

= g{z^) - 2 <532, +25^^^"^'+“ = g{z^) - 2z^^F{z) 

J=0 

where F{z) = ^f=o S32j+25Z^"^^ ■ 

But <^32, +25 = <52(16, + 12)+1 = <^ 16,+12 = <52(8,+6) = <Js,'+6 = <52(4,+3) = 
<54,+3 = <52(2 ,+i)+i = <52,+! = <5,') where we have used the fact that 
neither 8j + 6 nor 4j + 3 can be congruent to 25(mod32). So F{z) = 
Yl^o = g{z^^), and we have the FE 

g{z) = (1 + z)g{z‘^) - 2z^^g{z^^). 

How typical is this example? Do we always get Full Reduction (FR) of 
the index of <5? 

Consider the general case: 

B=[pip2 --.Pr] 

N = Pj. A- 2/3j._i + . . . + 2^ ^Pi 

K = Pr—l + 2Pj.—2 + . . . + 2 ^ Pi 



Case I: Pr = 0. As above, 

<52n+l — <5n; ^2n — 



— <5„ if n = A(mod2^ 
5n otherwise 



0{z) = zg{z^), F{z) = g{z^) - 2z^^ Y ^ 2 -p+Kz'^''^ ■ 

,=o 



To get FR the index 1(1) = = 2’’ + K must reduce to j by 

repeated applications of the mapping g{n): 

g{2n + l) = n, g{2n) = n unless n = AT(mod 2^“^). 



Let {/(I), 7(2), . . .} be the succession of indices that we get by repeating 
(assuming it works), and let I denote one of these indices. Whether 
I = 2n + 1 or 7 = 2n, reduction to n occurs by dropping the last 
binary digit on the right of 7 and shifting what’s left 1 slot to the right. 
For reduction to fail at the first step, 7(1) must be of the form 2n 
where n = 7L(mod2^“^), or n = 2^“^m + K for some integer m, or 
2n = 2^m + 2K. 

The binary expansion (BE) of K is {P1P2 ••• Pr-i) so that of 2K is 
{Pi P2 ■ ■ ■ Pr-l 0 ). 
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So for the first reduction 7(1) ^ 1(2) to fail the BE of 1(1) must end in 
(/3i j32 ■ ■ ■ fir-i 0). This is possible (i.e., there are integers j which make 
it possible) iff the BE of 1(1) ends in (j 32 Ps ■ ■ ■ /S^-iO), or (since the BE 
of 1(1) ends in that of K) 



(Pi P2 ■ ■ ■ Pr-l) — (P2 Ps ■ ■ ■ Pr-lO) ■ 

Assuming this equation does not hold we get 1 ( 2 ) whose BE ends in 
(P1P2 ■■■ Pr-2)- As above, 1 ( 2 ) 1 ( 3 ) fails iff the BE of 1 ( 2 ) ends 

in (Pi P2 ■ ■ ■ Pr-i 0) which is possible (again, there are integers j which 
make it possible) iff 1 ( 2 ) ends in (P^ P 4 . . . Pr-i 0), or 

(Pi P2 ■ ■ ■ Pr-2) = (Ps Pa ■■ ■ Pr-i 0 ) . 

Call the block B = [Pi P 2 ... Pr] nonrepeatable if 

[Pi P2 ■ ■ ■ Pu\ 7 ^ [Pr—(v—l) Pr—(v— 2 ) ■ ■ ■ Pr\ 



for each v, 1 < u < r — 1 . 

Theorem. ER works iff B is nonrepeatable. When ER works we get 
the EE g(z) = (1 + z)g(z‘^) — 2z‘^^ g(z‘^"^). 

Case II: Pr = 1. The above argument works when B is nonrepeatable 
up to the last step, yielding: 

Theorem. If [Pi P 2 ... Pu] 7 ^ [Pr-(u-i) Pr-{u- 2 ) ■ ■ ■ Pr] for each v ,2 < 
1 / < r — 1, and Pi = Pr = 1, then reduction works up until the final step 
and we get the EE 

g(z) = (l + z)g(z^)- 2 z^^+^-‘^^~" 

Other cases are not so neat. 

Example. R = [1 1 0 1 1 1] . 

The EE is 

g(z) = (1 + z)g(P^) - 2z'^g(z^^) + 2z'^g(z^^) + 2z‘^^g(z^^) 

Example. R = [1 0 1 1 0 1] . 

The EE is 



g(z^^-^)-g(z^^) 



g(z) = (1 + z)g(z^) - 2 z^[g(z^) - (1 + z^)g(z^^)] - 2 z^^[g(z^^) - g(z^% 
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The general “1-1” case, /?i = /^^ = 1- 



^2n — ^ri7 ^2n+l — 



—5n if n = iir(mod 2” 
^ 5n otherwise 

K = (3r-i + 2/3^_2 2” ^/?i , 

E{z) = g{z ^) , 



0{z) = zgiz"^) - 2 ^ - 2Gb(z) 

n=K 

(mod 2^“^) 
oo 

where Gb{z) = E ^2^-'^j+KZ^ i+2if+l^ 

1=0 



Basic idea: Reduce the subscript of 5 as much as possible, express 

Gb{z) in terms of Gb{z^^) for some p > 0, replace Gb{z^^) by using 
-2Gb{z‘^") = 0{z^^)-z^^g{z^^^^)=g{z^^)-g{z^^^')-z^^g{z^^^^) and 
then repeat to get the desired expression for 0(z) = g{z) — giz"^)- 
The result for the “fully repeatable” case, (3j = l,l<j<r, is: 



g(z) = (1 - z)g{z^) + 2z[g{z^) + z^g{z^) + z^g{z^^) 

+ . . . + z^^~"-^g{z^^~') + z 2 ”-'- 2 ^(^ 2 ^)]. 

4. Group Algebras 

Consider a finite abelian group A with group composition written as 
multiplication. The group algebra CA is the vector space of formal sums 

f='^f{u)u, f{u)GC, 

ueA 

with algebra multiplication defined by 

f9 = '^('^f{u)g{u~^v)\v, f,geCA. 

v&A VnsA / 

Identifying u G A with the formal sum u, we can view yl as a subset 
of CA. CA is a commutative algebra with identity. In this section we 
consider group algebras over direct products of the cyclic group of order 

2 . 

Denote the direct product of N copies of the cyclic group of order 2 
by 

C2(xo, . . .,XN-l) 
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A2{xo,...,xn-i)- 



C 2 (xo, ■ ■ ■ ,xn-i) is the set of monomials 

■■■ x^j^d, j „ = 0 , 1 , 0 < n < N, 
with multiplication defined by 

Xn = 1, 0 < n < N, (8) 

XmXn = XnXm, 0 <m, Tl < N. (9) 

We call the factors Xn, for 0 < n <N, the generators of C2{xq, . . . ,xn-i)- 
For example, 

C 2 {xo) = {l,xo} 



and 



C2{xo,Xi) = {1 ,Xo,Xi,XqXi}. 
7^2 (xo, . . . , X]\f-i) is the algebra of polynomials 



/ = X] ■ ■ ■ • • • ^^N-l > 

io=o jN-i=o 



with multiplication defined by (8) and (9). For example in 7l2(a^o) the 
algebra multiplication is given by 



(ao + aiXo){bo + bixo) = cq + ciXq, 



where 



Co 




ao 


ai 




0- 

0 

1 


Cl 




ai 


ao 




[ 61 J 



and in A 2 {xo,xi) the algebra multiplication is given by 

(ao + aixo + 02X1 + a 3 XoXi){bo + biXo + 62X1 + 63X9X1) = 
Co + CiXo + C2X1 + C3X0X1, 

where 



Co 




ao ai 02 03 




1 

0 

1 


Cl 




Oi Oo 03 02 




61 


C2 




02 03 Oo Oi 




62 


C3 




03 02 Oi Oo 




CO 

.0 



In general multiplication in t12(xo, • • • ,xat_i) can be described by the 
action of a 2^ x 2^ block circulant matrix having 2x2 circulant blocks. 
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The monomials 

Xq ■■■ , jn = 0, 1, 0 < n < IV, 

define a basis, the canonical basis of the vector space A 2 {xq, . . . , xn-i)- 
The basis is ordered by the lexicographic ordering on the exponents. For 
example, as listed, 

l,xo 

is the canonical basis of A 2 {xq) and 

l,Xo,Xi,XoXi 

is the canonical basis of A 2 {xq,xi). 

A nonzero element r G CA is called a character of A if r(l) = 1 and 

VT = V ^ a. 

Denote the collection of characters of A hy A*. The characters of 

C2(xo, . . . ,XAT_i) 

are the set of products in A 2 (xq, • • • , xn-i) 

(1 + eoxo) • • • (1 + eAT-iXAT-i), = ±1, 0 < n < iV. 

The characters of C 2 (xo) are 



1 + xo, 1 - Xq. 

The characters of C'2(xq, • • • ,xtv-i) form a basis of the vector space 
A2(xq, • . . , xtv-i), called the character basis. The characters are ordered 
such that the matrix H 2 N of the character basis relative to the canonical 
basis is the A^-fold tensor product 



H2N = H2 ( 8 > • • • < 8 > H2, 



where 



H 2 



1 1 
1 -1 



is the 2x2 Fourier transform matrix. The matrix of the character basis 
of C 2 (xo) 



1 + Xo, 1 - Xo 



is H 2 and the matrix of the character basis of C' 2 (xo,xi) 



(1 + xo)(l + xi), (1 - xo)(l + xi), (1 + xo)(l - xi), (1 - xo)(l - Xi) 
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is 



if4 = 



1111 
1-1 1-1 
1 1 - 1-1 
1 - 1-1 1 



The set of group-translates of the characters of C' 2 (xo) 

{1 ± xo)x’l^ ■ ■ ■ , kn = 0,l, 0<n<N, 

forms a basis of ^ 2 ( 0 : 0 , • • • ,xat_i), called the translate- character basis. 
The translate-character basis is ordered by first forming the set of pairs 



(1 {1 - xo)x'l^ ■ ■ ■ (10) 



and then lexicographically ordering the exponents in the set of pairs. The 
matrix of the translate-character basis relative to the canonical basis is 
the A^-fold matrix direct sum 



The translate-character basis of A 2 {xo,xi) is 

l + xo, l-xo, {l + xo)xi, {l-xo)xi. 

The translate-character basis will be especially important in the de- 
velopment of PONS. The main reason is contained in the following dis- 
cussion. 

Consider / G A 2 {xo , . . . , xat_i) such that 

/(io, • • • , jAf-i) = ±1, in = 0,1, 0 < n < iV. 

The coefficients of the expansion of / over the translate-character basis 
are ±1 or 0 and for each exponent set 

/ci, . . . , kn = ±1, 0<n<N, 

exactly one element in the pair (10) has nonzero coefficient. For example 
the element in A 2 {xo,x\) 

/ = /O + flXQ + /2X1 -h hXQXl 



can be written as 

/ = /o(l + fofixo) + / 2(1 + f 2 hxo)xi. 
If /o, fi, / 2 , h = ±1, then /o/i, / 2/0 = ±1. 
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5. Reformulation of classical PONS 

We reformulate the classical PONS construction procedure as de- 
scribed in Section 2.2 using group algebra operations, and we distinguish 
these orthonormal bases within the group algebra framework. For the 
purpose of this work we will modify P 2 N, the 2 ^ x 2 ^ PONS matrix, by 
row permutation and row multiplication by — 1 . 

Specifically we explore certain relationships between PONS matrices 
and character basis matrices. In particular, we show that a PONS ma- 
trix is completely determined by its 0 -th row and the equivalent size 
character basis matrix. For a classical PONS matrix, the 0-th row can 
be constructed arithmetically and provides an example of a splitting se- 
quence. The concept of a splitting sequence will be developed in a group 
algebra framework in the next section. 

By the original PONS construction as given in Section 2.2 the 4x4 
PONS matrix is 

■ 1 1 1 -1 ■ 

11-11 
1-1 1 1 ■ 

1 -1 -1 -1 

Interchanging the 1st and 2nd row we have the matrix 



Pa 



111-1 
1-111 
11-11 
1 -1 -1 -1 



The component-wise product of any two rows of P 4 is a row of the 
character basis matrix In fact 



Pa — HaDai 



where D 4 is the diagonal matrix formed by the 0-th row of P 4 . 

In general, by row permutation the classical 2^ x 2^ PONS matrix 
can be transformed into the matrix 



P2N = H2N D2N , 

where D 2 N is the diagonal matrix formed by the 0 -th row of the classical 
PONS matrix. 

The 0-th row of the original 4x4 PONS matrix can be defined arith- 
metically. From the binary representation of the integers 0 < n < 4 



00 , 01 , 10 , 11 
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we see that —1 is placed in the position having two I’s. Once D 2 N is 
constructed independently of the original construction, we can define the 
2 ^ X 2^ PONS matrix as 



P2N = H2N D2N ■ 

In the following sections we will place this result in the group algebra 
setting. 

Several other definitions are possible. For example 

D2N H2N D2N 

can be viewed as a symmetrical form of the 2^ x 2^ PONS matrix. 



6. Group Algebra of Classical PONS 

For an integer N > 0, we will use group algebra operations to define a 
PONS sequence of size 2^ . The first 2”, n < N, terms of this sequence 
are the same as the elements in the 0-th row of the classical 2" x 2" 
PONS matrix. PONS sequences will be identified with elements in the 
group algebra having special properties. Below we will extend these 
results to give a group algebra characterization of general binary splitting 
sequences. 

Denote the elements of the canonical basis of A 2 {xq, . . . , Xn-i) by vq, 
ui, . . ., V 2 ^-i- For a G C 



a — (oo, ai, . . . , a2"-i) 

identify a with the element a in A 2 {xq, . . . , Xn-i) 



2"-l 

a — > 

r=0 



If n = 4, 
Set 
and 



a = aol + aixo + a2X\ + 03X9X1. 
(X2 = I + Xq, 0-2 = I — Xq 



«4 = «2 + «2*1) a\ = a2 — OI 2 X 1 . 
corresponds to the sequence 



I I I -I 



which is the 0-th row of P4. 
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Set 

CX2^ = Ck2"-1 + O^n-ia^n— 1 

and 

0 ^ 2 ^ — Oi’^TL — X OLc2n — \XYi . 

The sequence corresponding to 02 " is the 0-th row of P 2 " • 

We will study the group algebra properties of the elements « 2 " and 
a^n. The key to understanding the reason for expansions over the 
translate-character basis is contained in the character product formula 



ai = 2(l + xo), K)2 = 2(1-xo) (11) 



a2tt2 = 0. 



(12) 



Implications of these formulas will be seen throughout this work. 
Since 

= CX2 -j- 2 cK2Q^2^1 T (^ 2 ) 5 

we have by (9) and (12) 



a^ = 2(l+xo) + 2(l-xo)=4. (13) 



In the same way 

(«:)" = 4 . (14) 

Since 

a4a2 = — (a^)^; 

by (9) 

a4a2 = 2(1 -I- xo) - 2(1 - xq) = 4xq. (15) 

We can write the important factorization 



04 = a4Xo- (16) 

By (16) 

as = «4 + 01 * 4 X 2 = 04(1 -I- X 0 X 2 ) (17) 

and 

ag = 04(1 - X 0 X 2 ). (18) 

Since 



(1 -h XQX 2 f = 2(1 -h X 0 X 2 ), (1 - XqX 2 )^ = 2(1 - X 0 X 2 ), 



and 



(1 -h XqX2)(1 - X 0 X 2 ) = 0, 
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we have 

aj = 8{1 + X 0 X 2 ), {alf = 8{1 - X 0 X 2 ) , 

and 

agag = 0 . 

«4 and «8 have very different group algebra properties reflecting the 
expansion of 04 

«4 = 1 + + (1 - xo)xi 

in terms of the conjugate characters 1 + xq and 1 — xq and the expansion 
of CTg 

ag = 04(1 + X0X2) 

in which the character 1 + xqX 2 is a factor. This result is general for «2" 
depending on whether n is even or odd. 

Arguing as above 

ai6 = «8 + cCgXg = «4 [(1 + X0X2) + (1 - X0X2)X3] , 

a*iQ = ag- «gX3 = «4 [(1 + X0X2) - (1 - XqX2)x 3] 
from which we have 

«16 = = 16 , 
aieaie = 16 xqX 2 , 

“I6 = Oil 6 XoX 2 - 

The same arguments show 

<^32 = «16 + CH1QX4 = ai6(l + X0X2X4) 

®32 = « 16(1 — X0X2X4). 

In general if n is odd 



and 



Q!2" — (U2"-l (1 + XqX2 • • • Xn-l), 
0^2^ — 0^2^^—! (1 XqX2 * * * Xfi — ) 



ai„ = 2^(1 + X 0 X 2 • • • x„_i), (a 2 nf = 2”(1 - xqX 2 • • • x^-i) 

® 2 "® 2 " ~ 6 . 



If n is even 



«2" = «2"-2 [(1 + X 0 X 2 • • • Xn- 2 ) + (1 — a:oX2 • • • Xn-2)Xn-l] 
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Oi2n — CX2n—2 [(1 “t" XqX2 * * * ^n— 2 ) (1 ^0^2 * * * ^n— 2)^n— l] 

and 

0^2" ~ (®2"j ~ ^ ) 

a 2 ^a 2 n = 2 ^XqX 2 ■ ■ ■ Xn-2 
= U2^XqX 2 • • • Xn-2- 

An important implication of these formulas is that if n is odd, a 2 ^ is 
not invertible in the group algebra while if n is even, a 2 -n- is invertible 
with inverse 2 “"'a 2 ". 

7. Group Algebra Convolution 

In this section we relate convolution in vl 2 (xo, • • • , a^TV-i) by the PONS 
element U 2 n with the PONS matrix P 2 N . 

Consider 

a G A2{xq, . . .,XN-i)- 

The mapping C 2 N{oi) : A 2 {xq, . . . ,rE7v-i) — > ^ 2 (a^ 0 ) • • • ^xn-i) defined 
by 

C2N{a)P = af3, j] G A2 {xo, ■ - - ,XN-i) 

is a linear mapping of A 2 (xq, • • • ,xn-i) called convolution by a. The 
matrix of C 2 N{a) relative to the canonical basis is 

C2N(a) = 



For 



«4 = 1 + Xo + Xi — XqXi 



the matrix C 4 (a 4 ) can be formed from the products 



Xoa4 = 1 + Xo — Xi + XqXi 

Xia4 = 1 — Xo + Xi + XqXi 

XoXiQ; 4 = — 1 + Xo + Xi + XqXi- 

1 -1 ■ 

-1 1 
1 1 ■ 

1 1 

Note that, as described in Section 2.5, the above is the 4x4 symmetric 
PONS matrix. 



C4(a4) — 



1 1 

1 1 

1 -1 
-1 1 
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We have defined the 4x4 PONS matrix as 



Pa = 



111-1 
1-111 
11-11 
1 -1 -1 -1 



Convolution by «4 and P 4 are related by 



C4(a4) = i?4-P(4, 2)^4, 



(19) 



where H 4 is the diagonal matrix formed by the 0-th row of P 4 and P(4, 2) 
is the 4x4 stride by 2 permutation matrix 



P(4,2) 



10 0 0 
0 0 10 
0 10 0 
0 0 0 1 



Since P 4 = H 4 ^Di, 

C4(cr4) = Dj^P{A, 

In group algebra terminology, -P(4, 2) is the matrix relative to the canon- 
ical basis of the automorphism of A 2 {xq,xi) defined by the group auto- 
morphism of C 2 {xq,X\) 



Xq 



xi, Xl 



Xq. 



By (19), up to row permutation and multiplication by —1, convolution 
by the classical PONS element «4 in A 2 {xq, xi) coincides with the action 
of the 4x4 PONS matrix P 4 . This will be the case whenever N is even 
and 2^ is the length of the classical PONS element. The length 16 case 
will be considered below. 

For N odd, since 

ag = 8(1 -I- X 0 X 2 ) 



and as is not an invertible element in A 2 {xq, xi, X 2 ), convolution by as 
in A 2 {xq, Xi, X 2 ) cannot coincide with Pg even after row permutation and 
multiplication by — 1 . 

Set 



J2 



0 1 
1 0 ’ 



J 2 N = In ( 8 > J2- 



Since 



ag = 04(1 + X0X2) 
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Cs(l +XQX2) = 



we have 



By (19) 



Csicts) — 



h Ja 

J4 l4 



h Ja 
Ja h 



{I2 ® €4(04)) . 



Cs(a8) — 



D4 J4D4 
J4D4 D4 



(/2®P(4,2))(/2®P4). (20) 



where 



Ps — HsDg, 



Hs = {H2 ® Ia){I2 ® H4) 



Dq — (D4 (E) I2){l2 ® Da)- 



By (19) 



Pg = {H 2 ® h) {{h ® H4){D4 ® l2){h ® H^^)) {I 2 ® Pa)- 
Direct computation shows 

{h ® H4){D4 ® h){l2 ®Hl^) = (I 2 ® P{^,2)){h © Ja){I2 ® ^(4,2)), 

where © is the matrix direct sum. 

These results show that 



C^{a^)P^ ^ = 



D4 J4D4 
J4D4 D4 



{I4 © Ja){H^^ © l4){h © -P(4,2)). 



Since 



J4D4J4 = D1 = 



we have the main result relating Cs(as) and Fg 



Csiag) — 



1 r D4 + DI D4-DI 



2 {D4 + ZI4) J4 (D4 ~ D4) J4 



(/2©P(4,2))P8. 



A direct computation shows 



C8(as) = DgQg{l2 © P(4, 2))Pg 
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where Dg = H 4 © and 

‘ 1 0 0 0 0 0 0 0 ‘ 

0 1 0 0 0 0 0 0 

0 0 0 0 0 0 1 0 

_ 00000001 
01000000 ■ 

1 0 0 0 0 0 0 0 

0 0 0 0 0 0 0 1 

0 0 0 0 0 0 1 0 

Qs is a singular matrix. In fact if 

w = Pgv, 

then Cs(a 8 )v computes only the components 



Wo, W2, W5, W7. 

The computation of the remaining components is given by C8(ag)v. 
Since 

«16 = «4 [(1 + X0X2) + (1 - XQX2)xg] , 
convolution by a\o in A2{xq, x±, X2, X3)can be written as 

r s _ r C8(l +X0X2) Cs{ 1 -XqX 2 )]^ : , 

Cw(aie) - [ Cs,(l - Cs(l + xoxj) J 

Arguing as before 

Cl6(ai6)= (/4®A)4P(4,2))(l4©P4), 

_ -^8 ^8 _ 



where 





h Y* _ h —Ja 

J4 I4 J ’ "^8 “ [ -J4 /4 • 

P\G = HiqDio, 



Hie = {Hi(^ h){h(^ H4) 



(T >4 © h){h © T »4 © h){h © D4). 



P 16 = {Pi © /4)(/4 © Hi){l2 © T>4 © h){h © Hl^){h © Pi)- 
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By direct computation 

{h <8) H4){l2 (8) A <8) h){h <8) 

= {h < 8 ) P{4, 2)){h © J 4 © l 4 © J 4)(/4 © P{4, 2)). 



Combining the preceding formulas 
Cw{oil6)PiQ^ 



Xs X*s 
X*s Xs 



{h^D4){hej^ehej4){p^^0U){h^p{4,2)). 



Since 

h h h h 

h —h h ~h 

h h —h —h 

—h h h —h 

and 

,hD^J^ = Dl, 

we can derive the main result relating Ci6(ai6) and Pig. 



P4 ^ © /4 = 



Ci%{aiQ) = Yi6(/4 <8> P(4, 2))Pi6, 



where 



^16 = 



1 

2 



ZI4 + ZI4 ZI4 — 0 

0 0 {D 4 + 

Di -Dl D 4 + DI 0 

0 0 (P>4 - Dl)Ji 



0 

{Dl - D^)Ji 
0 

— {D^^ + DDJ 4 



Direct computation shows that 



Die — DiqQiq 
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where Qiq is the permutation matrix 

1 0 0 0 0 0 0 0 

01000000 
00000010 ^ ^ 

0 0 0 0 0 0 0 1 

0 1 0 0 0 0 0 0 

10000000 
^ ^ 00000001 

0 0 0 0 0 0 1 0 

0 0 0 0 1 0 0 0 

00000100 
00100000 ^ ^ 

0 0 0 1 0 0 0 0 

0 0 0 0 0 1 0 0 

00001000 
^ ^ 00010000 

0 0 1 0 0 0 0 0 . 

From these formulas 

Cl6(«16) = DiqQiq{I 4 ^ (g) P(4, 2))Piq. 

The permutation matrix Qiq corresponds to the automorphism of the 
algebra A2{xq,x\,X2,x^) induced by the group automorphism defined 
by 

XQ > Xq 

Xl — > X1X3 

X2 > X3 

X3 > 3 ^ 03 ^ 2 - 

Combined with the automorphism defining (I 4 (g) P(4, 2)) 

Xq > Xl 

Xl > Xq 

X2 > X2 

X3 > X3 

the permutation matrix Qiq{I 4 (g) P(4, 2)) corresponds to the automor- 
phism of the algebra defined by 

Xq > X1X3 

Xl > Xq 

X 2 > X3 

X3 > XqX2. 
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8. Splitting Sequences 

The concept of a splitting sequence was discussed in Sections 2.3 and 
2.4. In this section we will describe the binary splitting sequences using 
group algebra operations. We begin by establishing notation. 

The characters of C 2 {xo) are 

1 + xo, 1 -xq. (21) 

The expressions 

±(l+xo), ±(l-xo) 

will be denoted by A with or without subscripts. Typically 

A = a(l + exo), (22) 

where a = ±1 and e = ±1. a is called the coefficient of A and e is called 
the sign of A. In general 

A1A2 = 0 

if and only if sign{Xi) = —sign{\ 2 )- A is called a directed character of 
C2(xo). 

Identify sequences of length 2^ with linear combinations over the 
canonical basis of ^ 2 (^ 0 , • • • , xtv-i)- A sequence of length 4 

boi 61, 62, 



is identified with the element 



a = 60 + b\XQ + 62X1 + 63X0X1 



in ^ 2 (xo,xi). 

Only sequences of ±1 will be considered. Every sequence of length 
2^ of ±1 uniquely determines a sequence of directed characters 

Ao, Ai, . . . , Atv -1 

and consequently a sequence of signs, the n-th sign equal to the sign 
of A„, and a sequence of coefficients, the n-th coefficient equal to the 
coefficient of A„. If 



a = -(I + xo) + (I - xo)xi, 

then the corresponding sequence of directed characters is 



(I + xo), I-xo 
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the corresponding sequence of signs is 

+ - 

and the corresponding sequence of coefficients is 

- + . 



The splitting condition on sequences of ±1 places conditions on the 
sign patterns of splitting sequences. The following tables describe the 
sign patterns for N = 2, 3 and 4. 



Table 1. Sign patterns for splitting sequences 


N = 2 


- + + - 


N = 3 


- + -+ - + + - H h- H + 


N = 4 


- + -H ^ h -H h + - + - - + H + + - 

H h-H h- +-H + -+ H hH + 



Consider a length 4 splitting sequence 04 and write 

04 = Aq + AiXi, 

where Aq, Ai are directed characters. The length 4 splitting condition 
implies 

Ag + A^ = 1 . 

By the character product formula 



AoAi = 0. (23) 

Condition (23) is also sufficient for splitting, proving the following result. 

Theorem 24 is a splitting sequence of length 4 if md only if 04 has 
the form 

«4 = ±((1 + xo) ± (1 - a;o)a:i) 

or 

«4 = ±((1 - Xq) ± (1 + Xo)xi). 



The splitting sequences of length 4 having sign pattern 

+ - 



are given by 



«4 = ±((1 + xo) ± (1 - a;o)a:i) 
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while those having sign pattern 



- + 



are given by 

04 = ±((1 - Xo) ± (1 + Xo)xi). 

Corollary 25 If 04 is a splitting sequence of length 4, then o| = 4. 

The condition = 4 is also sufficient for a sequence of ±1 of length 
4 to be a splitting sequence. 

Consider a splitting sequence as of length 8 given by the directed 
characters 

Aq, Ai, A 2, A3. (24) 

08 can be written 

Os = O4 + O4X2, 

where 

04 = Aq + AiXi, 04 = A2 + A3X1, 

are splitting sequences of length 4. The length 4 splitting condition 
implies 

04 + (04)^ = 8. 

There are 4 cases to consider. Suppose 04 and o| have sign pattern 

- + . 

The length 8 splitting condition implies that the coefficients of the di- 
rected characters in (24) satisfy 



— aooi — 0203 — 0 . 



Since 
we have 
and 



0404 = 20002(1 - Xo) -I- 20103(1 -I- Xo), 

O4O4 = i 4 xo 



Os = 04(1 ± X0X2). ( 25 ) 

The same argument shows that if 04 and o| have the sign pattern 

H — ) 



then og has the same form. 
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Suppose 04 has the sign pattern 

- + 



and «4 has the sign pattern 

+ - . 

The splitting condition implies 

— aoo.1 + a.20'3 = 0. 



Since 

040:2 = (20003(1 - xo ) + 20402(1 + xo )) xi , 

we have 

q; 4Q;4 = i4xi 

and 

Os = ^ 4(1 i X 1 X 2 ). (26) 

The same result holds if we reverse the patterns of 04 and 

Since (25) and (26) define splitting sequences whenever «4 is a split- 
ting sequence of length 4, we have the following result. 

Theorem 26 as is a splitting sequence of length 8 if and only if as has 
one of the two forms 

as = 04(1 i XQX2) 



or 



as = 04(1 i X1X2), 



where 04 is an arbitrary splitting sequence of length f. 



As the proof shows, the splitting sequences of length 8 having sign 
pattern 

~ h, ~ ~ 

are given by 

04(1 ± X0X2) 

while those having sign patterns 






are given by 



04(1 ± 0:1X2). 



Corollary 27 If as is a splitting sequence of length 8, then 

Og = 8(1 ± X0X2) 
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or 

a§ = 8(1 ± X1X3). 



Consider a splitting sequence a\e of length 16 and write 



016 — Os + OgXs, 

where ag and Og are length 8 splitting sequences. The splitting condition 
on aie implies 

Os + (og)^ = 16. 

By Theorem 26 



agOg = 0. 

Suppose ag has sign pattern 



(27) 



+ - + - . 



By Theorem 26 ag has the form 

ag = 04(1 ± X0X2), 

where 04 is a length 4 splitting sequence having sign pattern 

+ - . 



Consider the case 



cig — 04(1 + X0X2). 

If 014 is given by the directed characters 

Aq Ai, 

then ag is given by the directed characters 

Aq Ai Ao — Ai. 



By (27) 



Og = 04(1 — X0X2). 

There are two cases. Suppose a| has sign pattern 



(28) 



+ - 

and is given by the directed characters 

A2 A3. 
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ag is given by the directed characters 

A 2 A 3 — A 2 A 3 . (29) 

The splitting condition on aie applied to (28) and (29) implies 

— aoo.1 + a.20'3 = 0. 

Since 

a 4«4 = 20002(1 + xo ) + 20103(1 - Xo ), 

we have 

q; 4Q;4 = i4 

and 

ai6 = ai[{l + X 0 X 2 ) ± (1 - xoX2)x3], 

Suppose has sign pattern 

- + . 

Q!g is given by the directed characters 

A 2 A 3 A 2 — A 3 (30) 

and has sign pattern 

- + - + . 

The splitting condition on aio applied to (28) and (30) implies 

— oooi + 02 O 3 = 0 . 



Since 

a 40;4 = 20003(1 + Xo)xi + 20102(1 - xo ) xi , 

we have 

q; 4Q;4 = i4xi 

and 

«16 = «4[(1 + a:oa:^2) ± (1 - xoX2)xiX3]. 
Similar arguments prove the following result. 
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Theorem 28 aie is a splitting sequence of length 16 if and only if a±Q 
has one of the following forms. 

Type 1. ai[{l + xqX 2 ) ± {I - XQX 2 )xf\ 

ai[{l - X0X2) ± (1 + a:oX2)x3] 

Type 2. a 4 [(l + X 0 X 2 ) ± (1 - xoX 2 )xiXs] 

a4[(l - 2 : 0 X 2 ) ± (1 + XoX2)xiXs] 

Type 3. a 4 [(l + X 1 X 2 ) ± (1 - a:iX 2 )x 3 ] 

a^[{l - X 1 X 2 ) ± (1 + a:iX2)x3] 
a^[{l + X 1 X 2 ) + (1 - a:iX2)xoX3] 
ai[{l - X 1 X 2 ) + (1 + a:iX2)xoX3], 

where 0:4 is a length 4 splitting sequence. 

The length 16 splitting sequences in Theorem 28 have been organized 
according to their sign patterns. 

Type 1 ^ ^ ^ h 

+ - + - + - + - 

Type 2 - + - + + - + - 

+ - + h- + 

Type 3 hH hH 

+ h + h. 

Corollary 29 //aie is a length 16 splitting sequence, then afg = 16. 
The condition 

= 16 

is not sufficient for aig to be a splitting sequence. 

A similar argument proves the following result. 

Theorem 30 The splitting sequences of length 32 have one of the fol- 
lowing forms. 

Type 1 ai 6 (l ±xoX 2 X 4 ), where aig is Type 1. 

Type 2 ai 6 (l ±xiX 3 X 4 ), where aig is Type 1. 

Type 3 ai 6 (l ± X 0 X 2 X 4 ), ai 6 (l ±xiX 3 X 4 ), where aig is Type 2. 

Type 4 ai 6 (l ± X 1 X 2 X 4 ), ai 6 (l ±xoX 3 X 4 ), where aig is Type 3. 
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The classification of length 32 splitting sequences into types corre- 
sponds to the sign patterns of the type. 



Type 


1 


- 


-b 


- 


-b 


- 


-b 


- 


-b 


- 


-b 


- 


-b 


- 


-b 


- 


-b 






-b 


- 


-b 


- 


-b 


- 


-b 


- 


-b 


- 


-b 


- 


-b 


- 


-b 


- 


Type 


2 


- 


-b 


- 


-b 


- 


-b 


- 


-b 


-b 


- 


-b 


- 


-b 


- 


-b 


- 






-b 


- 


-b 


- 


-b 


- 


-b 


- 


- 


-b 


- 


-b 


- 


-b 


- 


-b 


Type 


3 


- 


-b 


- 


-b 


-b 


- 


-b 


- 


- 


-b 


- 


-b 


-b 


- 


-b 


- 






-b 


- 


-b 


- 


- 


-b 


- 


-b 


-b 


- 


-b 


- 


- 


-b 


- 


-b 


Type 


4 


-b 


- 


- 


-b 


-b 


- 


- 


-b 


-b 


- 


- 


-b 


-b 


- 


- 


-b 






— 


-b 


-b 


— 


— 


-b 


-b 


— 


— 


-b 


-b 


— 


— 


-b 


-b 


— 



References 

[1] K.G. Beauchamp. Applications of Walsh and Related Functions. Academic 
Press, London, 1984. 

[2] J. Brillhart and L. Carlitz. Note on the shapiro polynomials. Proc. AMS, 
25:114-118, 1970. 

[3] S.Z. Budisin. New complementary pairs of sequences. Electronics Letters, 
26(13):881-883, 21 June 1990. 

[4] S.Z. Budisin. Efficient pulse compressor for Golay complementary sequences. 
Electronics Letters, 27(3):219-220, 31 January 1991. 

[5] S.Z. Budisin, B.M. Popovic, and L.M. Indjin. Designing radar signals using 
complementary sequences. Proc. lEE Conf. RADAR 87, pages 593-597, Oct. 
1987. 

[6] J.S. Byrnes. Quadrature mirror filters, low crest factor arrays, functions achiev- 
ing optimal uncertainty principle bounds, and complete orthonormal sequences 
— a unffied approach. Applied and Computational Harmonic Analysis, 1:261- 
266, 1994. 

[7] J.S. Byrnes. A low complexity energy spreading transform coder. In Y. Zeevi 
and R. Goifman, editors. Signal and Image Representation in Combined Spaces, 
Haifa, 1997. 

[8] J.S. Byrnes, W. Moran, and B. Saffari. Smooth PONS. The .Journal of Fourier 
Analysis and Applications, 6(6):663-674, 2000. 

[9] J.S. Byrnes, M.A. Ramalho, G.K. Ostheimer, and I. Gertner. Discrete one 
dimensional signal processing method and apparatus using energy spreading 
coding. U.S. Patent number 5,913,186, 1999. 

[10] J.S. Byrnes, B. Saffari, and H.S. Shapiro. Energy spreading and data com- 
pression using the Prometheus orthonormal set. In Proc. 1996 IEEE Signal 
Processing Conf., Loen, Norway, 1996. 

[11] J.A. Davis and J. Jedwab. Peak-to-mean power control in OFDM, Golay comple- 
mentary sequences, and Reed-Muller codes. IEEE Trans. Information Theory, 
45(7):2397-2417, November 1999. 




196 



COMPUTATIONAL NONCOMMUTATIVE ALGEBRA 



[12] M.J.E. Golay. Complementary series. IEEE Trans. Information Theory, 7:82- 
87, April 1961. 

[13] John E. Gray and Soon H. Leong. On a subclass of Welti codes and Hadamard 
matrices. IEEE Trans. Electromagnetic Compatibility, 12(2): 167-170, 1990. 

[14] Hans Dieter Liike. Sets of one and higher dimensional Welti codes and comple- 
mentary codes. IEEE Trans. Aerospace and Electronic Systems, 21(2):170-178, 
March 1985. 

[15] B.M. Popovic. Power efficient multitone signals with flat amplitude spectrum. 
IEEE Trans. Communications, 39(7):1031-1033, July 1991. 

[16] B.M. Popovic. New RACH preambles with low autocorrelation sidelobes and re- 
duced detector complexity. In Proc. of the fth CDMA International Conference, 
Vol. 2, pages 157-161, September 1999. 

[17] B.M. Popovic. Spreading sequences for multicarrier CDMA systems. IEEE 
Trans. Communications, 47(6):918-926, June 1999. 

[18] B.M. Popovic. Spreading sequences for multicarrier CDMA systems. IEEE 
Trans. Communications, 47(6):918-926, June 1999. 

[19] B.M. Popovic, N. Suehiro, and P.Z. Fan. Orthogonal sets of quadriphase se- 
quences with good correlation properties. IEEE Trans. Information Theory, 
48(4):956-959, April 2002. 

[20] W. Rudin. Some theorems on Fourier coefficients. Proc. Amer. Math. Soc, 
10:855-859, 1959. 

[21] H.S. Shapiro. Extremal problems for polynomials and power series. Sc.M. thesis, 
Massachusetts Institute of Technology, 1951. 

[22] M. Taghavi. Upper bounds for the autocorrelation coefHcients of the Rudin- 
Shapiro polynomials. Korean Journal of Comp, and Appl. Math., 4(l):39-46, 
1997. 

[23] R. J. Turyn. Hadamard matrices, Baumert-Hall units, four symbol sequences, 
pulse compression and surface wave-encoding. Journal of Combinatorial Theory, 
16:313-333, 1974. 

[24] S. R. Weller, W. Moran, and J. S. Byrnes. On the use of the PONS sequences 
for peak-to-mean power control in OFDM. In Proc. of the Workshop on Defense 
Applications in Signal Processing, pages 203-209, LaSalle, Illinois, USA, August 
1999. 

[25] G. R. Welti. Quaternary codes for pulsed radar. IRE Trans. Inf. Theory, 6:400- 
408, 1960. 

[26] Roland Wilson and John Richter. Generation and performance of quadraphase 
Welti codes for radar and synchronization of coherent and differentially coherent 
PSK. IEEE Trans. Communications, 27(9):1296-1301, September 1979. 




CLIFFORD ALGEBRAS 
AS UNIFIED LANGUAGE 
FOR IMAGE PROCESSING 
AND PATTERN RECOGNITION 



Valeriy Labunets 
Urals State Tecnical University 
Ekaterinburg, Russia 
lab@rtf.ustu.ru 



How wonderful are the things the Lord does! All who are delighted with them 
want to understand them. 

— Good News Bible, Psalm 111:2 



Abstract The main goal of the paper is to show that commutative hypercomplex 
algebras and Clifford algebras can be used to solve problems of multi- 
color image processing and pattern recognition in a natural and effective 
manner. 



Keywords: Clifford algebra, color images, hyperspectral images, invariants, multi- 
color images, pattern recognition. 

I. Introduction 

The concept of color and multispectral image recognition connects the 
topics we consider in this work. The term “multispectral (multicolor, 
multicomponent) image” is defined for an image with more than one 
component. An RGB image is an example of a color image featuring 
three separate image components R(red), G(green), and B(blue). We 
know that primates and animals with different evolutionary histories 
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have color visual systems of different dimensionality. For example, the 
human brain uses three channel (RGB) images to recognize color (RGB)- 
images. Primates have dichromacy and trichromacy visual systems, and 
they can use various 2D and 3D channels for the same purpose. Non- 
primates have monochromacy and dichromacy visual systems. Images 
of such systems are real — or complex — valued functions. Reptiles have 
multichromacy visual systems. For example, the tortoise visual system 
has five types of color photoreceptors (R,G,B, DG,UV). Shrimps have 
the biggest known dimension of the visual system. They use ten spectral 
types of photoreceptors in their eyes to recognize fine spectral details. 
Our approach to multicolor image processing is to use so-called multiplet 
(multicolor or hypercomplex) numbers [l]-[7] to describe multicolor im- 
ages and to operate directly on multi-channel images as on single-channel 
multiplet-valued images. In the classical approach every multicolor pixel 
(in particular, color pixel) is associated to a point of a kD multicolor vec- 
tor space (to a point of a 3D RGB vector space for color images). In 
our approach, each image multicolor pixel is considered not as a kD vec- 
tor, but as a multiplet (triplet) number, and multicolor (color) space is 
identified with the so-called multiplet (triplet) algebra. Note that both 
these assumptions (vector and hypercomplex natures of multicolor im- 
ages) are only hypotheses. We have no biological evidence in the form 
of experiments that would verify that the brain actually uses any of the 
algebraic properties arising from structures of the vector spaces or the 
multiplet (triplet) algebra. It is our aim to show that the use of Glifford 
algebras fits more naturally to the tasks of multicolor image processing 
and recognition of multicolor patterns than does the use of color vec- 
tor spaces. We give algebraic models of animals’ visual systems using 
different hypercomplex and Glifford algebras. Our hypotheses are 

1 Brains of primates operate with hypercomplex numbers during im- 
age processing and recognition. 

2 Brains use different algebras on two levels (retina and Visual Gor- 
tex) for two general goals: image processing and pattern recog- 
nition, respectively. Multicolor images appear on the retina as 
functions with values in a multiplet kD algebra (/c-cycle algebra) 
where k is the number of image spectral channels. But multicolor 
images in an animals’ Visual Gortex (VG) are functions with values 
in a 2^-D Glifford algebra. 

3 Visual systems of animals with different evolutionary histories use 
different hyper complex algebras for color and multicolor image pro- 
cessing. 
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One of the main and interesting problems of information science is clar- 
ification of how animals’ eyes and brain recognize objects in the real 
world. Practice shows that they successfully cope with this problem 
and recognize objects at different locations, of different views and illu- 
mination, and with different degrees of blurring. But how is it done by 
the brain? How do we see? How do we recognize moving and chang- 
ing objects of the surrounding world? A moving object is fixed in the 
retina as a sequence of different images. As in the famous aphorism 
of Heraclitus, who pointed out that one cannot step into the same 
river twice, we literally never see the same object twice. No individual 
image allows reaching a conclusion about the true shape of the object. 
This means that a set of sequential images appearing in the retina must 
contain a constant “something,” thanks to which we see and recognize 
the object as a whole. This constant “something” is called invariant. 
For example, all letters ‘F’ in Fig. 1 we interpret as the same for dif- 
ferent geometric distortions. This fact means that all geometrically 

= go (F) rn 

A ^ 

Figure 1. Geometrical distorted versa of letter ‘F’ 




distorted letters ‘F’ contain invariant features, which are not changed, 
when the shape of ‘F’ is changed. Our brain can extract these invari- 
ant features. In Fig. 2 we see hyperbolic (non-Euclidean) motions of 
grey- level mice and color fish. All transformed figures are interpreted as 
being the same. This fact means that all figures contain invariant fea- 
tures with respect to hyperbolic motions (and color transformations), 
and our brain can extract these invariant features from images, too. So, 
we see, we live in 3D Euclidean space but our brain can calculate invari- 
ants of images with respect to non-Euclidean transformations. In order 
for an artificial pattern recognition system to perform in the same way 
as any biological visual system, the recognition result should be invari- 
ant with respect to various transformation groups of the patterns such 
as translation, rotation, size variation, and change in illumination and 
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Figure 2. Non-Euclidean motions 



color. The present work describes new methods of image recognition 
based on an algebraic-geometric theory of invariants. In this approach, 
each color or multicolor pixel is considered not as a kD vector, but as a 
kD hypercomplex number {k is the number of image spectral channels). 
Changes in the surrounding world which cause object shape and color 
transformations are treated not as matrix transforms, but as the action 
of some Clifford numbers in physical and perceptual spaces. We shall 
present some clues that Nature gives us about the role and importance 
of computing with hypercomplex numbers. We wish to review some of 
the reasons why such a state of affairs is necessary from a computa- 
tional point of view. One can argue that Nature has also learned to 
utilize (through evolution) properties of hypercomplex numbers. Thus, 
the Visual Cortex might have the ability to operate as a Clifford algebra 
device. We don’t agree with Kronecker that “T/ie Lord created the 
integers and the rest is the work of nianT We think that the Lord knew 
Clifford algebras, and he was the first engineer who used these algebras 
to design animals’ visual systems. 

2. Clifford algebras as models of physical spaces 
2.1 Algebras of physical spaces 

We suppose that a brain calculates some hypercomplex-valued invari- 
ants of an image when recognizing it. Hypercomplex algebras generalize 
the algebras of complex numbers, quaternions and octonions. Of course, 
the algebraic nature of hypercomplex numbers must correspond to the 
spaces with respect to geometrically perceivable properties. For recog- 
nition of 2D, 3D and nD images we turn the spaces R^, and R” into 
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corresponding algebras of hypercomplex numbers. Let “small” nD space 
be spanned by the orthonormal basis of n space hyperimaginary units 
li, i = 1,2, .. . n. We assume 

( +1 for i = l,2,...,p, 

= < -1 loi i=p+l,p + 2,...,p + q, 

[ 0 loT i = p + q + l,p + q + 2, . . . ,p + q + r = n, 

and lilj = —Ijli- Now, we construct the “big” 2 ^^D hypercomplex space 
R^". Let b = (61,62) •• • ,bn) G B2 be an arbitrary n-bit vector, where 
6j G B2 = {0, 1 } and B2 is the nD Boolean algebra. Let us introduce 
l’’ := ■ Then 2 ” elements form a basis of 2 ”D space, i.e., 

for all C G R^ we have C := X^beB" ^blb- If C\,C 2 G R^ , then we can 
define their product C1C2. There are ‘dfi' possibilities for if = +1,0, —1, 
Vi = 1, 2 , . . . , n. Every possibility generates one algebra. Therefore, the 
space R^ with 3 ” rules of the multiplication forms 3 ”^ different 2 *^D 
algebras, which are called the space Clifford algebras [8]. We denote 

these algebras by Vl2«^^’'^’’'^(R|Li, . • • , In), or if Ii, . . . , In, 

p, q, r are fixed. 

Example 1 ITe start with the space R^ and provide it with the algebraic 
frame of algebras of generalized complex numbers: R^ ^ Vl 2 (R) := 
R+R/ = {z = xi+Ix2 I xi,X 2 G R}, where I is a generalized imaginary 
unit. 

■ If P = —1, i.e., I = i, then Vl 2 (R|i) = COM = {xi + ix 2 \ 
X\,X 2 G R;i^ = —1} is the field of complex numbers. 

■ If P = +1, i.e., I = e, then Vl 2 (R|e) = DOU = {xi + ex 2 \ 

xi,X 2 G R; = 1} is the ring of double numbers. 

• If P = 0, i.e., I = e, then Vl 2 (R|e) = DUA = {xi + ex 2 | 

X\,X 2 G R;e^ = 0} is the ring of dual numbers. 

Example 2 Quaternions, as constructed by Hamilton, form the fD al- 
gebra A4 = A,4(R) = A4fiR\l,i,j,k) = R + Ri + Rj + R/c spanned 
by 4 hyperimaginary units l,i,j,k. The identities = kf = — 1 , 

ij = —ji = k are valid for these units. = p = P = 5 G {—1,0, 1} 
can be set. Here, the two latter values (0 and 1) result in non-classical 
quaternions that were proposed by Clifford [8]. Introducing notations 
I, J, K for new hyperimaginary units, we get nine spatial algebras of 
generalized quaternions, A,4(R|1, 1 , J, K) := A4 = R + RI + RJ + RiV 
depending on which of nine possibilities resulting from P G {1,0, —1}, 
P G {1,0,—!} is valid for the two independent hyperimaginary units. 
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Every generalized quaternion q has the unique representation of the form 
q = 9o + <lil + Q. 2 J + q'iK = 5c(q) + Vec(q), where qo, qi,q 2 , qs are real 
numbers and 5c(q) := qo, Vec(q) := qil + q 2 J + qsK are sealar and 
veetor parts of the quaternion q, respeetively. 

We can make be a ranked and Z/2Z-graded algebra. Let r(b) be 
the Hamming weight (= rank) of b, i.e., a functional r : 'Blf — > [0, n — 1] 
defined by r(b) := XlILi ^(t>) = ^(t>) (mod 2) be the grad of 

b. Then A^n can be represented as the ranked and Z/2Z-graded sums 
•^ 2 " ~ ©r=o "^ 2 " ~ ©a=o ’ where the dimension of the 

vector space aI^I equals the binomial coefficient and Ylk=o C!f = 2"'. 

The dimensions of ^ 2 "^ -^ 2 ^^ equal to 2"'“^. The subspaces A^^l 

are spanned by the /c-products of units /q/ij • • • Lq (ii < Z 2 < • • • < 
ifc), i.e., by all basis vectors Ib with r(b) = k. Every element C := 
SbeB" '^blb of A 2 S has the representations: C = + . . . + Cl"'! 

and C = , where G .A^n is the scalar part of the Clifford 

numbers, G A^^l is its vector part, G A-^n is its bi vector part, 
. . . , CW G A, 2 n^ is its n-vector part, and, finally, and are even 
and odd parts of the Clifford number C. If C G A^^ , we put d{C) = I and 
say that I is the degree of C. Multiplication of two Clifford numbers 
of ranks k and s gives the sum of Clifford numbers from \k — s\ to 
p = min(A: + s,2n — k — s) with increment 2, i.e., + 

Q[\k-s\+2] q[p\_ 

2.2 Geometries of physical spaces 

The conjugation operation in maps every Clifford number 

C := coIo+YhhA^ '^blb to the number C = co/o~Z]b 7 ^o '^blb- The algebras 
A^Sp(p,g,r) transformed into 2"'D pseudometric spaces designated by 
or , if the pseudodistance between two Clifford numbers 

A and B is defined by p{A, B) = \J {A — B){A — B). Subspaces of pure 
vector Clifford numbers x\Ii + . . . + Xnin £ Vec^ are nD spaces 

R"" := ■ The pseudometrics constructed in induce 

corresponding pseudometrics in 

Example 3 /n A, 2 (R) we introduee a eonjugation operation whieh maps 
every element z = xi + 1x2 to the element 7 , = x\ — 1x2- Now, the gen- 
eralized eomplex plane is turned into the pseudometrie spaee A. 2 (R) — > 
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g^5p(p,<j,r) defines the pseudodistanee as: 



\/{x 2 - xi^ + {y 2 - yi)2, if z e A 2 (R\i) , 

ifze ^2(R|e), 
ifze ^2(R|e), 



p{zi,Z2) = { y/{x2 - Xi)2 - (y2 - yi)2 
\x2 - a:i|, 



where zi := (xi,X 2 ) = xi +1x2, Z 2 := {yi,y 2 ) = Vi + Iy 2 - So, the plane 
of the elassieal eomplex numbers is the 2D Euelidean spaee 
the double numbers plane is the 2D Minkowskian spaee and 

the dual numbers plane is the 2D Galilean spaee ■ When one 

speaks about all three algebras (or geometries) simultaneously, then the 
eorresponding algebra (or geometry) is that of generalized complex num- 
bers, denoted by (or )■ 

Example 4 InAifiR) we introduee a eonjugation operation whieh maps 
every quaternion q = qq + I q\ + Jq 2 + Kq^ to the element q = qo — 
Iqi — Jq 2 — Kq^. If the pseudodistanee /o(p,q) between two general- 
ized quaternions p and q is defined as the modulus of their differenee 

u = p- q = t + xl + yJ + zK : /j(p, q) = 



(p-q)(p-q) = 

\/uu, then nine spatial algebras ^ 4 (R) are transformed into nine fD 
pseudometrie spaees designed as ^ ^ ^ gf^^d for 

the number of basis veetors with squares 1,-1 and 0, respeetively, and 
p + q + r = n. Thus, the pseudodistanee ean take positive, negative and 
pure imaginary values. There are only 5 different geometries gg{^Ap,<ix ) . 
gwf^(4’0’0\ The subspaees 

of pure veetor-valued generalized quaternions xl + yJ + zK are 3D 
spaees := . The pseudometries introdueed 

in gg{^Ap,<ix) g^iy different pseudometries in : 



I ^ I ^ 2 

/)(Vec{p}, Vec{q}) = |Vec{p - q}| = |Vec{u}| = { ^ ~ y’^ ~ , 



x"^ = \x\ 



They will be denoted by They form 

Euelidean, Minkowskian, and Galilean 3D pseudometrie spaees, respee- 
tively. 

All even Clifford numbers Sq G ^2"^ modulus represent the ro- 

tation group of the corresponding space g'R,f(^^^'‘^’''\ which is called the 
spinor group and is denoted by Spin ("^2" • Generalized complex 
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numbers and quaternions of unit modulus have the forms: eg = = 

cos (/9 + /sin (^, Qo = 6“°“^ = cos</9 + uosin(^, where cos</9 and sin(/3 are 
trigonometric functions in the corresponding nD -geomeiiies, 

is a rotation angle around the vector-valued quaternion uq of unit 
modulus (|uo| = 1, Uq = — Uq). Clifford numbers £q G Spin ^^2" 
with unit modulus have the analogous form £io = e“°‘'’ = cos(/9+uosin(/9G 
Spin foi' the appropriate bivector uq. 

Theorem 1 [8]. All motions in 2D, 3D and nD spaces 
gpiSp{p,q,r)^ represented in the forms: 

z = eozeo -|- w, x' = QoxQq ^ + w, 

where eo = Qo = So = and |eo| = |Qo| = |<?o| = 1- 

U |go|) |Qo|) I'^ol / 1) then the latter transformations form the “small” 

affine groups , re- 

speetively. 




a) 





c) 



Figure 3 . Rotations in a) 2D Euclidean space b) 2D Minkowskian space 

g^Sv(i, 1,0) 2)3 Galilean space 



Using this theorem, we can describe geometric distortions of images in 
the language of Clifford algebras. These distortions will be caused by: 
1) nD translations x — > x -|- w; 2) nD rotations x — > Sq(x + 

3) dilatation: x — > Ax, where A G R^. If /(x) is an initial image and 

A£’owf(q) is its distorted version, then ;,£-pvvf(x) := f ^A<So(x -|- , 

where A is a scale factor, x, w G We suppose that the human 

brain can use the spinor and “small” affine groups for mental rotations 
(see Fig. 3) and motions of images (for example, in a dream), which are 
contained in the brain memory on the so-called “screen of mind.” 
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3. Clifford Algebras as Models of Perceptual 
Multicolor Spaces 

Early in the 19th century YoUNG (1802) proposed that the human 
visual system contains three color mechanisms. This theory was later 
supported by Helmholtz (1852) and became known as the Young- 
Helmholtz theory of color vision. Later, Hering (1878) proposed that 
color vision is mediated by red-green and blue-yellow opponent mecha- 
nisms. Thus, for a time it appeared there were two conflicting theories 
of color vision. As a result of experimental work, it has since become 
recognized that the two theories describe different levels in the visual 
system. In agreement with the Young-Helmholtz theory, there are three 
photoreceptor mechanisms, i.e., cone types, in the human retina, and, 
in accordance with Hering’s theory, there are color-opponent neurons at 
higher levels in the VC. The multicomponent color image appears on the 
retina as the kD vector 





/i(x) 




■ /^5°^^(A,x)Hi(A)dA ■ 


fM?oz(x) = 


/2(x) 


= 


/^5°^^(A,x)H2(A)dA 




. /fc(x) _ 




_ /^5<’'^(A,x)Hfc(A)dA _ 



where 5"’^-^(A,x) is the color spectrum received from the point x of an 
object and Hi (A), H2(A), . . . , Hfc(A) are sensor sensitivity functions. We 
give algebraic models for two levels (retina and VC) of visual systems 
using different hypercomplex Z/fcZ-graded Clifford algebras. 

3.1 Algebraization of the Young-Helmholtz 
model 

3.1.1 The Young-Helmholtz model of color images. We 

shall represent RGB-color images that appear on the human retina as 
triplet-valued functions: f^^f (x) = /r(x) 1 <,oZ + fG{^)^col + 
where hyperimaginary units, = 1. Numbers of the 

form C = xl + yScoi + (ej)^; = 1) were considered by Greaves [10]. 

According to Greaves, these numbers are called the triplet or 3- cycle 
numbers. We shall call them the color numbers. The product of two 
color numbers C\ = oq + aiEcoi + a2£coZ ^2 = ^0 + ^i£coZ + 
given by 

C 1 C 2 = (oq -|- CLiEcol + ®2£coz)(^0 + biEcol + &2£coz) ~ 

{aobo + aib 2 + 0261) +{aibo + 0262 + ao^i)£coZ + (a2^o + oi6i -h a26o)£coZ- 
Thus, the color product is isomorphic to the 3-point cyclic convolution. 
The color conjugate C of a color number C = x + yEcoi + -zEcoZ defined 
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by C = xj- ye col + ze^^i = x + y^coi + zel^i = x + zEcoi + 2/£^- The norm 
\\C\\= CC \s given by \ \C\\^ = CC = (x^ + ?/^ + z^) — {xy + xz + yz). Each 
color number has three modules 

\\C\\i = \x + y + z\, \\C \\2 = '^x‘^ + y'^ + z^ - xy - xz - yz, 

||C ||3 = ^x^ + y^ + z^ - 3xyz 

possessing the properties: ||CiC 2 ||j = ||Ci||j||C 2 ||i, i = 1,2,3 and ||C ||3 = 
I |C| I 2 I |C| |i. Triplets, strictly speaking, do not form a 3D field, but form 
an associative so-called triplet (color) algebra 

Af = Af(K) = A^(n\l, Seal, eld) ■= + ^^coi + R^L- 

Greaves [10] considered a color number x + ye + ze^ (here, e = Ecoi) 




Figure 4- Geometric interpretation of triplet algebra 



as a point of 3D space and gave the following geometric interpretation 
of triplet operations. He constructed a sphere (see Fig. 4) with the 
center at the origin and denoted intersection points of the sphere with 
the coordinate axes X,Y,Z in the positive direction by l,m,n, respec- 
tively. He drew a circle via points l,m,n; the “symmetrical axis” eiu 
was drawn via the center of this circle and the origin; the “symmetrical 
plane” Ec/i is perpendicular to the axis e;„. Then Greaves considered 
rectangular projections of color numbers C on the axis eiu and plane 
Ecfe. Furthermore, he proved that every triplet C can be represented as 
a sum of a real number being depicted by a rectangular projection of 
the triplet on the axis and a complex number being depicted by a 
rectangular projection of the triplet on the plane Ec^. Therefore, the 
color algebra is the direct sum of the real R and complex C fields: 
^ 3 (R, C) = R • -I- C • Ec/i = R © C, where ei^ := (1 + £ + e^)/3, 
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Ec/i := (1 + +ct;|e)/3 are orthogonal “real” and “complex” idem- 

potents = eiu, = Bch, eiuBch = 'E‘cheiu = 0), respectively, and 
iOs := e~ . Therefore, every color number C = x + + ze^ is a lin- 

ear combination C = x + ye^ + ze^ = aiu ■ eiu + Zch ■ Ec/i = {aiu, Zch) 
of the “scalar” aiu^iu and “complex” parts Zch^ch in the idempotent 
basis {e;^i,Ec/i}. We will call the real numbers aiu G R the luminance 
(intensity) numbers, and the complex numbers Zch = b + jc £ C the 
chromaticity numbers. Obviously, aiuGiu := Ceiu = (x -|- ye^ -|- = 

{x+y+z)eiu, Zch^ch ■= CEch = {x+ye^ + ze'^)Ech = {x+ujly + zuj^)Ech, 
where aiu = x + y + z, '^ch = x -|- -|- ziol- In the new representation 

two main arithmetic operations have very simple forms: 

Cl ± C 2 = (aie;„ -I- ziEc/i) ± (a2e;„ -|- Z2Ech) = (ai ± a2)e;« -|- (zi ± Z2)Ech, 

C 1 C 2 (^a^Giu ZiEf.hfia2(^lu T ’^2Ecli) (cIiC12)g;i^-1-(ziZ2)E^/,^, 

or, briefly, Ci ± C2 = (ai,zi) ± (02, Z2) = (ai ± 02, zi ± Z2), C1C2 = 
(ai,zi) • (a2,Z2) = (0102, Z1Z2). For C we have C = x + ye^ ze'^ = 
^lu ■ T ^c/i ■ Rc/i ifliui ^ch)- Therefore, 

\\C\\i = \x + y + z\ = \aiu\, \\C\\l = \x‘^ + y"^ + z"^ - xy - xz - yz\ = \\zch\\‘^, 

l|C||i = \x^ + y^ + z^ - 3 xyz\ = \aiu\\zch\\‘^ . 

The norms ||.||i, ||.||2 are called the luminance and chromaticity norms, 
respectively. We can consider a color image in the two forms: 

y) = fnix, y)lcoi +fG{x, y)scoi + fsix, y)el^i = 

fiu{x, y'jdiuA' fc/i(x, y)Ec/j. 

In the second form we have separated the color image into two terms: the 
luminance (intensity) term fiu{x,y) and the chromacity term fch{x,y) 
(color information). This color transformation is a linear projection (see 
Fig. 4) of the color vector-valued pixel in the color space on the diagonal 
vector eiu '■= (1, 1, 1) = ii? + ic + is and on the 2D plane 7 Tc/i(C) which 
is orthogonal to the diagonal vector eiu '■ T^chiC) Te;„. The vector is 
called the luminance (white) vector and the 2D plane 7 Tc/i(C) is called 
the chromacy plane of RGB-space. The triangle A(iijiGis) || 7 Tc/i(C) 
drawn between three primaries is called the Maxwell triangle. 

The plain 7Tc/i(C) is equipped with the structure of the complex field C. 
Therefore, we can consider an RGB image as a pair of images f^|*(x) = 
(/z«(x),fcfe(x)), where /;„(x) := (ez„|f^^f (x)) = /r(x) -F /g(x) -h /s(x), 
fc/i(x) := fcoz(x) - fiu{-x)eiu, and (/;„(x)|fc/i(x)) = 0. Here, /z„(x) : 
R2 — > R+ is a real- valued image (grey-level image) and fcZi(x) : R^ — > 
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TTchiC) is a complex-valued image. Hence, each pixel is represented both 
by a real number fiu{^) for the luminance and a complex number fc/i(x) 
for the chromatic information. The argument of the complex number 
fc/i(x) is directly equivalent to the traditional definition of hue, and the 
modulus of the complex number is similar to saturation. Changes in 
both the surrounding world and mental space of reality such as intensity, 
color or illumination can be treated in the triplet algebra language as 
the action of some transformation groups in the perceptual color space 
A^°^. Let us consider some of them. 

1 Let ^ = (1, e**^). Then transformation of color images f^®*(x) — > 
^fcoi(x) = (1,6**^) • (/«^,(x),fcfe(x)) = (/i„(x),e*‘^fc/i(x)) changes 
only the hue of the initial image. The set of all such transforma- 
tions forms the hue orthogonal group BIOG(2) := {(l,e*‘^)| G 

C}. 

2 Now let A = (1,A), A > 0. Then transformation of color images 

> Alfcoz(x) = (1,A) • (/i„(x),fcfe(x)) = (/«„(x),Afcfe(x)) 
changes only the saturation of the initial image. The set of all such 
transformations forms the saturation dilation group SDG(2) := 
{(l,a)| aeR+}. 

3 If ^ = (l,Zc/i) = (1, Izc/ile**^), then transformations f^f*(x) — > 

Alfcoz(x) = (1, Izc/ile**^) • (/z„(x),fc/i(x)) = (/j„(x), |zc/i|e*‘^fcfe(x)) 

change both hue and saturation of the initial image. The set 
of all such transformations forms the ehromatie group GG(2) := 

{(l,Zc/i) = I Izc/ile**^ e C}. 

4 If ^ = (aiu,Zch) = (ozn, |zch|e*'^), where a > 0, then transfor- 
mations fc^f(x) — > Alfcoz(x) = (az„, Izc/ile**^) • (/«„(x), fc/i(x)) = 
(azu/zu(x), |zcZi|e*‘^fc/i(x)) change luminance, hue and saturation of 
the initial image. The set of all such transformations forms the 

luminanee- ehromatie group LCG(2) = ^{aiu,'Zch) \ {aiu £ 

(zcZ^GC)}. 

3.1.2 The Young— Helmholtz fc-cycle model of multicolor 
images. We will interpret multicolor images (1) as multiplet-valued 
signals 

fMcoz(x) = /o(x)1mcoZ + /i(x)eifcoZ + • • • + /fc-i(x)e^coZ> X G R”, 
which take values in the multiplet /c-cycle algebra 

j^Mcol _ 'iAlMcol + R-^McoZ + • • • + R-^tfcoZ’ 
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where 1 , , • • • , i^Mcoi ~ multicolor hyperimaginary units. 

We will denote them by l,e^, . . . This algebra is called the mul- 

tiplet (multicolor) algebra. One can show that this algebra is the direct 
sum of the real and complex fields: 






£[ReL| + |][C.Ej 



_ Tlhzi 



where kiu = 1,2 and kch = f , if k is odd or even, respectively, and 
e\^ and are orthogonal idempotent units such that 
eiej = ejei, = E^^E*^, and ej^E)^ = E(^ej^ = 0, 

for all i,j. Every multiplet C can be represented as a linear combination 
of kiu “scalar” parts and kch “complex” parts: 

klu kch 

C = ^{ai • e\^) + ■ E(,^). 

i=l j=l 

The real numbers a* G R are called the multi-intensity numbers and 
complex numbers Zj = 6 + ic G C are called the multi- chromacity num- 
bers. Now we will interpret the multicolor nD image appearing on the 
nD retina as a multiplet-valued nD signal of the form: 

klu ^ch 

fMcoi(x) = 2[//„(x) • elJ + ^[/4(x) • E(f) = 



(/ii(x), . . . , ft" (x); /4(x), . . . , ■ 

Here, the argument x belongs to the nD vector part of the space 

algebra A^S. Changes in both the surrounding world and mental spaces 
of reality, such as multi-intensity and multi-color, can be treated in the 
language of multiplet algebra as the action of the following transforma- 
tion group in the perceptual multicolor space Letting 






’ • • • ’ \^ch 



A<:h\J‘Pk, 



the following transformations of a multicolor image 

fMcoi(x) > ^fMco^x) = 
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change multi-luminancies, multi- hues and multi-saturations of the initial 
multicolor image. The set of all such transformations forms the multi- 
luminance/ multi- chromatic group MLCG(A:) := , a 



Nch \ 
^Ch 



hv 
lu 1 ^ch’ 



A 



‘‘lu 



G R+, 



Ah 



gC}. 



3.1.3 Multiorthounitary transforms of mnlticolor images. 

A 2D discrete multicolor image can be defined as a 2D array fjdcoZ ~ 




1 ^ 
- 



, i.e., as a 2D discrete 



.A^“^-valued function of one of 



the following forms: 



: Z% 



aMcoI 






R 



hv 






Here, every color pixel at position (i,j) is a multicolor number 

of the type = /o(b j01mco« + /i(b J>mcoZ + - • - + /fc-i(b J>mco« 

or of the type f^^i{i,j)) = Ei=i[fLihj) ' ejj + ' Kh\ = 

[fluihj), ■■■, j); fchihj), ■■■, fctihj)) ■ III particular, for color 

images, we have f^^f (i, j) = fr{ij)+fg{ij)s^+fb{hj)^^ andf^^f(i,j) = 
fiuih j)^iu + ^ch{h j)^ch- The Set of all such images forms N^D Greaves- 

Hilbert space . The vector space structure of this space is de- 

fined with multiplication by triplet-valued scalars {Cf){i,j) := Cf{i,j). 

Our A^^D Greaves-Hilbert space over can be inter- 
preted as kN'^T) Hilbert space over R, i.e., as = R-^^l -|- 

R^"e^ + . . . + or as a direct sum of A^^D real and complex 

Hilbert spaces 




2=1 








^lu 

[® 





In particular, for color images (A,§°^)^ = R'^^ -|- or 

(A,§°^)’^ = R'^^ © where R^^, R'^^e^, are real Al^ Hilbert 

spaces of red, green, and blue images, respectively, R^^ is A^^D real 
space of gray-level images, and is A^^D complex space of chromatic 

images. Let (A.^'^°*) be a Greaves-Hilbert space over 
We say that the operator 2,2D '■ ^ , ^2d[^^coi\ = 

F^loi is multicolor linear if and only if for all ^^coV^^coi ^ 
and for all C G .Af“^ + SmIozI = 

^ 2 d[C^m^J = C£ 2 D[fMco«]. Otherwise, we call £ 2 D multicolor nonlin- 
ear. If £, 2 d is a multicolor linear operator and ||det(T 2 D)|l 2 does not 
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vanish, then we call operator £20 nonsingular. Otherwise, we call £20 
singular. The inverse of an operator £20 exists if and only if £21) 
is nonsingular, is calculated in the same way as an ordinary in- 

verse matrix. The set of nonsingular multicolor linear operators form 
the general multieolor linear groups over multiplet algebra and is 

denoted by G'L{N,A^‘^°’'). It is then easy to dehne the adjoint opera- 
tor £2^ for the color linear operator £20 whose essential properties are 
i^Mcoi\^2DgTcoi) = i^WMiilsTcoi)^ and for i£ 2D2^2D the property 
( 1^20 2 - 220 )* = 2^2D^^2D 

Definition 1 A multieolor linear operator £20 on is said 

to he multi-orthounitary if = £20- 

Multi-orthounitary operators form the multi-orthounitary group MOU 
= This group is isomorphic to the direct sum of kiu or- 
thogonal and kch unitary groups, = 

klu kch ^lu ^ch 

[ 0 O(R)eL] + [ 0 U(C)E^^J = [ 0 O(R)] © [ 0 U(C)‘ . 

i=l j=l i=l j=l 

In particular, the orthounitary group of transformations for color images 
has the decomposition: OU(.A3°^) = 0 (R)e;„ + U(C)Ec/i. Every element 
of MOU(.A^“^) and OU(^3°^) has the representation: 

^lu ^ch 

^2D^\u + -22D = 02Z)e;„ + Z^2 dEc/i, (2) 

i=l j=l 

where G 0 *(R), O2D G 0 (R), and G U-^(C) U2D G U(C) are 
orthogonal and unitary transforms, respectively. For multicolor image 
processing we shall use separable 2D transforms. 

Definition 2 ITe eall the multi- orthounitary transform ^2D[fM^coi\ 
arable if it ean be represented by l^2D[fM^coi\ ~ he., 

^2D = - 2 id © is the tensor produet of two ID multi- orthounitary 
transforms of the form: 

klu ^ch 

£2D = £id © = [ 0 { 0 \ © O* ) e^] + [ 0 (ui © Ui) (3) 

i=l j=l 

for multieolor images, and the tensor produet of two ID orthounitary 
transforms £20 = -2id © -2id = (Oi © 02)6;^ + {U\ © ^2)Ec/i for eolor 
images, where 0 ^, 0 ^ andUl,U2 are ID orthogonal and unitary trans- 
forms, respeetively. 
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Figure 5. Color Walsh-Fourier 2D5 = We;„ + EEch, Hartley-Fourier ~ 

Hteiu + EEch and Haar-Fourier = Hrei„ + EEch transforms, respectively 



In particular, we can obtain any orthounitary transform, using any two 
pairs of orthogonal Oi, O2 and unitary transforms U\,U2- It is possible to 
use one pair of orthogonal and unitary transforms, when Oi = O2 = O 
and IA\ = IA2 = U ■ this case we obtain a wide family of orthounitary 

transforms of the form: H2D = (O (g) 0 )eiu + {U 0 U)'Ech, using differ- 
ent ID orthogonal transforms. The following table shows some of the 
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Hll 


T 


CW 


CTV 


CH 


w 


Wciu -1- ZGEcji 


Weiu+CWE,h 


We;„ + CTVE^h 


Cl~CEich 


Hd 


Hdeju + T'Eich 


Hde/ii + CWEch 


Udeiu + CTVEch 


Hdeju + CHEch 


Ht 


HtCiu + tFUjch 


Uteiu + CWEch 


Hteju + CiFVEch 


Uteiu + CT^Ech. 


Hr 




Hre/ii + CWEch, 


Hreiu + CTPEch 


Hre/ii + CT^Ec/i 


Wv 


Wveiu + TEch 


Wvei„+CWEc. 


Wve;„ +CTPEch 


+ Cl~C¥jch 



where W, Hd, Ht, Hr, Wv are Walsh, Hadamard, Hartley, Haar, 
and Wavelet orthogonal transforms, and fF, CW, CFV^ and CTi are 
Fourier, complex Walsh, complex Fourier-Prometheus, complex Haar 
transforms. Every pair (O, U) of an orthogonal O and a unitary Z// trans- 
form generates an orthounitary (color) transform £ = Oeiu+WEich- Some 
examples of basis color functions of color transforms 2 B 5 = We;„-|-.FEch, 
= Hte;„ -I- FBch, = Hre/tj -|- FBch are shown in Fig. 5. 

3.2 Algebraization of the Hering model 

3.2.1 The Hering Z/2Z-graded model of color images. Let 

us consider 3D color space RJr + RJg+ spanned by three 

new hyperimaginary units Ji (i = 1,2,3 or i = R,G,B) : J\ = Jr 

(red unit), J 2 = Jg (green unit), J 3 = Jr (blue unit). We assume 
= Or, Jq = ttg, = Ofe, where ar,ag,ah G {-|-1,0, — 1}. We assume 
J“l = -|-1 for i = 1 , . . . , a, Jf = —1 for i = a . ,a + (3, Jf = 0 

for i = a -|- /3 -|- 1, . . . , a -|- /3 -|- 7 , where a + (3 + = 2>. Further, we 

construct a new color Clifford algebra Jg, Jr) = R<^sz + 

(RJB-|-RJG + R<^B) + (R<^RG + R«^BB+R</GB)+R<^Wh) where Jbi = 1, 
Jwh = JrJgJb are black and white units, Irg := Jr Jg, Jrb ■= Jr Jb, 
Jgb ■= JgJb- This is an algebra of generalized color octonions with 
signature (a,/3, 7 ). 

Definition 3 The functions ^ Jg, Jb) of 

the form: 

fcoF(x) = fBi{^)JfD + (^fR{x)jR + /g(x)Jg -h /b(x)Jb)-H 

+ (/bg(x) Jrg + fRB(^)JRB + fGB{^)JGB^ + fwh{^)Jwh 

are called the .A^F^'^’^’^’^^-valued color nD images appearing in the human 
Visual Cortex. 

The second opponent cells map R,G,B components on the 4D unit sphere 
fBi+fRG+fRB+fcB = where fRG, fRS, fGB, fBi are black, red-green, 
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red-blue and green-blue components, respectively. Therefore, resulting 
from the capacity of this algebraic model of color image, we can formu- 
late the spin-valued function 

fcoZ^(x) := + (^/rg(x)Jrg + fRB{^)JRB + fGB{^)JGB^ 

which has values in the spin part Spin^^g°^^( J/j, Jc, of the color 

Clifford algebra. 

Definition 4 The functions f^p : — > Spin(^A'^\J r, Jq, Jb)^ , and 

fp^p : R"" — > Spin(^A'^\j R, Jq, Jb)^ are called the spin-valued color 2D 
and nD images, respectively. 



For this model, color changes in both the surrounding world and men- 
tal spaces can be described as the action of the color spinor trans- 
formation group in the perceptual color spaces and Spin^^g°^^ 



Co G Spin^^|°^^, then the transformations f^p(? 



^^oF(^) — ^ ^ and f^^p(x) — > Cof^^p(x)Co ^ are called the left, 

right and two-sided spinor-color transformations of f^^p (x) , respectively. 



3.2.2 The Hering Z/fcZ-graded model of multicolor images. 

To form the algebraic model of multicolor images in the animals’ VC, 
we consider a k^D multicolor quantum Clifford algebra 



QQj^Mcol{ao,ai,...,ak-i) 



(Ti , . . . , Jm) 



(i.e., Clifford algebra with signature (ao> ^ «A:-i) and deformed 

by a A: primitive root of unity iVk) generated by multicolor hypercom- 
plex units Ji, . . . , Jm with relations Jp = uj^ for oq hypercomplex units, 
Jp = ijj^ for a\ hypercomplex units, . . . , Jp = uj^~^ for Ofc-i hyper- 
complex units, and JjJi = uokJiJj, if i < j, where oq + + • • • + 

Ofc-i = m, (j := [9]. The elements J® = " Jfif form a ba- 
sis of . . . , Jm), where s = (si, . . . , G B^, 

Si G Bfe = {0, . . . , A — 1}. We shall denote this algebra by QCA^m°\ if 
Ji, . . . , Jm, and (oo) oi, . . . , Ofc-i) are fixed. In the particular case when 
k = 2 {ujk = — 1), m = 3 and oq = a, a\ = f3, 02 = 7 , the k^D quan- 
tum Clifford algebra . . . , Jm) is the generalized 

color octonion algebra We can make QCA^rf°^ be a ranked 

and Z/kZ-graded algebra. Let r(s) be the Hamming weight of s, i.e., 

a functional r : Bp — > [0, m — 1] defined by r(b) := YlF=i [l ~ > 
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and let d{s) = Si (mod k) be the grad of s. We set QCA^^jn = 

SPAn|j®| r(s) = r|, obtaining the ranked sum 

m 

QCAfrfi°^ = 0 QCA'I. 

r=0 

By setting QCA^m = SPAn|j®| 5(s) = I mod fc|, we get the 7i/k7i- 
graded R-algebra QCA^A°^ = 0 QCA].™. We put d{C) = I and say 

i=0 

that I is the degree of C. . We say that <i>g and are graded and 
ranked automorphisms if ^g(^QCA]^l^ = QCA]^1 and = 

QCA^^jn for all / = 0, 1, . . . , A: — 1, and r = 0, 1, . . . , m, respectively. Let 
qca£1 = span{j ® I r(s) = l| be the vector part of the quantum 
Clifford algebra QCA^A°^. Then the set of automorphisms of unit mod- 
ulus = QCJ^^jn forms a transformation group of QCA^.™. 

We shall call it the quantum spinor group and denote it by 

QSpin^QCA^™'). 

Definition 5 The funetions 

are ealled the nD Cliffordean-valued and quantum-spin-valued images 
appearing in the animals’ VC. 

For this model, multicolor changes in both the surrounding world and 
mental spaces can be described as the action of the quantum spinor 
transformation group in the perceptual color spaces QSpin^QCA^^°^^ 

and QCA^m°^ . If Cq G QSpin^QCA^^°*^ , then the transformations 

fMcoi(x) — > Cof]^^oj(x), 

fM?oz(x) > fMco«(x)C(7^ and fMeoz(x) — > 

are called the left, right and two-sided spinor-color transformations of 
respectively. Further, we interpret an image as an embed- 
ding of a manifold in a spatial-multicolor Clifford algebra of higher di- 
mension. The embedding manifold is a “hybrid” (spatial-multicolor) 
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space that includes spatial coordinates as well as color coordinates. For 
example, a 2D color image is considered as a 3D manifold in the 5D 
spatial-color space / 2 ; Jr,Jg,Jb) = (RA + R-^ 2 ) © (RJr + 

RJg + RJb) = R^p ® R-coi’ whose coordinates are {x,y, /r, fa, fs), 
where x G R/i, y G R/2 are spatial coordinates and Jr G RJr, 
fa G RJg) /b £ R«/b, are color coordinates. It is clear that the ge- 
ometrical, color and spatial-multicolor spaces R^p, R^p^coZ 

generate spatial, color and quantum spatial-multicolor Clifford algebras 



,Sp{p,q,r) 



Q^j^Mcol{ao,ai,...,ak-i) 



respec- 



tively. Here, all spatial hyperimaginary units commute with all multi- 
color units. 



3.2.3 ClifFord-unitary transforms of multicolor images. 

2D discrete Cliffordean-valued images appearing in the animals’ VC can 
be defined as a 2D array := Here, every pixel 

position (i,j) is a quantum Clifford number of the type 



nvicpi 

rVC 



'-Mcolihj) = E /s(bj)J" 
sGBT" 



The set of all such images forms the 



Clifford-Hilbert space • ^he vector 

structure of this space is defined with multiplication by Clifford-valued 
scalars {Cf){i,j) := Cf{i,j). We say that the operator 



/ \ iV^ / \ 

£ 2 d: [QCAffr^) (QCA^At°'-) , i.e., 2,2dK^coi] = ^\Fcoi 

is Clifford linear if and only if for all ^Xicoi^^iFcoi ^ and 

for all Ci,C 2 ,Hi,H 2 G QCA^ft°^ 

^ 2 D + ^ 2 g^ol^ 2 ] = Ci£, 2 d[^M^oi]^^ + C2-C2D [gM^oz]H 2 • 



Otherwise we call £, 2 d Clifford nonlinear. If 2,20 is a Clifford linear 
operator and | |det(£2D)| li does not vanish, then we call the opera- 
tor 220 nonsingular. Otherwise 220 is singular. The inverse 2f^ of 
the operator 220 exists if and only if 220 is nonsingular. The set of 
nonsingular Clifford linear operators forms the general Clifford linear 
groups over the quantum Clifford algebra QCA^A°^ and is denoted by 

GCL^iV, . It is then easy to define the adjoint operatorT^^) 

for the Clifford linear operator T 2 D) whose essential properties are 



{^M^ol\^20%Miol) — {^*20^Mcol\%Mcol) J 



MC 



VC \„VC 



and for the product 1 T 2 D 2^20 we have 

{ 1^20 2 ^ 20 )* = 2^*20 1^20- 
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'^2D ~ '^2D- 

Clifford-unitary operators form the Clifford-unitary multicolor group 
Cu(iV, . For multicolor image processing we shall use separa- 

ble 2D transforms. The Clifford-unitary transform ^2d[^Xj^oi\ ~ '^XFcoi 
is called separable if it can be represented by = £, 2 D ~ 

-CiD [fMraz] i-6-) ^ 2 D = -CiD SDIid is the tensor product of two 

ID Clifford-unitary transforms. 

4. Hypercomplex-valued invariants of nD 
multicolor images 

4.1 Clifford- valued invariants 

Let us assume that fMcoz(x) : is an image of a multi- 

color nD object. It appears on the nD eye retina as a function 
of space variables x G Vec^(.42™) = with values in the mul- 
ticolor algebra : fMcoz(^) • This image 

can be considered in the VC as a function of the same space 

variables x, but with values in the multicolor quantum Clifford algebra 
QCAfA°\ i.e., as f]^^„^(x) : QCAfA°K We shall denote 

both algebras A^^°^ and QCA^A°^ by A^^K Changes in the surround- 
ing world can be treated in the language of the spatial-multicolor al- 
gebra as an action of two groups: the space affine group Aff {giZff^'’^) 
acting on the physical space Vec^(.42n) = and the multi-color 

group MLCG(/c) acting on A^^°^ (if = A^^^) or the quantum 

spin group Spin(QC.4^“0 on (if = QCAfA°^). We 

shall denote both groups Spin(QC.A^m“^) and MLCG(A:) by AiCG'R-^'^°^ ■ 

Let x MCGTl^^°^ be a spatial-multicolor 

group, and (g'^^,g^“^) G QSpMcol^ where g*^^ G Aff{G'R-n‘^’^), G 

M.CGTl-^^°^ ■ If X G is a generalized space Clifford number and 

C G A^^°^ is a multicolor quantum Clifford number, then all products 
of the form xC are called spatial-color numbers. They form a space- 
color algebra := A^S <8> A^^°K Here, we assume that all spatial 

hyper imaginary units commute with all color units. 

Definition 6 The A^^^^°Gvalued functional J = AI [fA^coz(x)] of the 
image fMcoz(x) is called the relative G^^^^°Ginvariant if 

J = M O fMcoz(g^^ o x)| = C • Al{fMcoz(x)} • C“\ 
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Vg G QSpMcoi^ where C,C ^ are left and right A^^^^^^-valued multipliers. 
If C = 1 thenj is ealled the absolute invariant and denoted by X. 

Let c be the centroid of the image {mcoi{'^)- 
Definition 7 The funetionals 

Mp := M{fMcoij = f A^~ c)^fMco«(x)dx 

are ealled the eentral -valued moments of the nD image ImcoA): 

where p £ Q are rational numbers. 

Let us clarify the rules of moment transformations with respect to dis- 
tortions of color and geometry of the initial images. If iMcoii^) is the 

initial image, then fA,£-o,w(^*) =Co[^Mcoi {X£o{x-\-w)£q ^)|Co ^ denotes 

its .A^“^-multicolor and ^77^’'^’^-geometrical distorted copy. Here v,w 
are nD vectors. Summing v with w brings us to image translation by 
the vector w, two-sided multiplications A£io(x-|-w)iS|^^ by X£o and 
equivalent to both an nD rotation of the vector z -|- w and a dilata- 
tion given by the factor A. Here, ImcoI — > C-o^McoiCq^ is a multicolor 
transformation of the initial image. 

Theorem 2 The eentral moments AAp of the multieolor images iMcx>iA) 
are relative -valued invariants 

Ai>^£ovf^Mcol} ~-^pi>'£o'V^^Mcol} ~ (^X^^ CQ£^Aip{lMcol\ ^^0 ) 

( 4 ) 

with respeet to the spatial- multieolor group yjHh both 

valued left multipliers CqX^^^£^ and A^S-valued right multipliers £A ^ 
respeetively (see Fig. 6), and the normalized eentral moments 

Ap{^£ovf^Mcol}\~ l-^p{A£’owfA^coz}-^0 i^£ovf^Mcol}\ j M cod\ 

are absolute sealar-valued invariants, with respeet to the same group, 
i.e.. Ip = = \ffp{lMcoi}\- 

Now we consider the most important cases for invariants of grey-level, 
color and multicolor 2D and 3D images. 

4.2 Complex and quaternion invariants of 2D 
and 3D grey-level images 

Let = {z = X ly \ x,y £ H; P = —1,0, -1-1} be a generalized 
spatial complex plane. Then a grey-level 2D image /(x, y) can be consid- 
ered as a function of a generalized complex variable, i.e., f{x, y) = /(z), 
where z = x + ly £ Let c be the centroid of the image /(z). 




Clifford Algebras as a Unified Language 

C-0 

fmcoi(^) ^ A~ w)iSg )Cq 



219 



M 



X£o{^ + ^)£o 



-1 



{ fmcoi (^) } 



AP+”Co^o^{-}VCo“' 



M 



\P+^Co ( £^Mp{fmcoi{^)}£onCr^ 



Figure 6. Transformations of ^®^*^‘^°*-valued moments with respect to the spatial- 
multicolor group 



Definition 8 Functionals of the form 

mp{/} = [ (z - c)Pf{z)dz 

are called the one-index central valued moments of the image /(z), 
where dz := dxdy, and p £ Q are rational numbers. 

Let us clarify the rules of Alf^-valued moment transformations under 
geometrical distortions of the initial 2D images. We will consider trans- 
lation, rotation and scaling transformations. If /(z) is the initial image, 
then /v,w(z) = /(v(z -|- w)) denotes its geometrical distorted copy. 

Theorem 3 The central moments rhp{/} of the image /(z) are relative 
A 2 ^ -valued invariants 

J"p{/v,w} := rhp{/v,w} = vP|v|^rhp{/} 

with respect to the affine group AfF(^C 2 ^) with -valued multipli- 
ers (see Fig. 7), and the normalized central A^^- 

valued moments Afp{fv,w} : = rhp{/v,w}riio~^{/v,w}/riii{/v,w} are ab- 
solute A 2 ^ -valued invariants with respect to the same group, i.e., Tp = 

AAp{/v,w}=AAp{/}. 

Let us consider a 3D grey-level image f{x, y, z). This image can be con- 
sidered as a function of a generalized quaternion variable q = {xl -\-yJ -\- 
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v(z + w) 

/(z) ^ /(v(z + w)) 



M 



mp{/(z)} 



vPIvP 



M 



vP|vprhp{/(z)} 



Figure 7. Transformations of ^f^-valued moments with respect to the affine group 

Aff(gcf^’) 



zK), i.e., f{x,y,z) = /(q), where q G Vec{^ 4 ^} = QTZ^. Let c be the 
centroid of the image /(q). 

Definition 9 Functionals of the form 

■Mp{f} := [ (q-c)/(q)dq 

are called the one-index central ^f^-valued (quaternion-valued) mo- 
ments of the grey-level 3D image /(q), where p £ Q, and dq := dxdydz. 



Let us clarify the rules of moment transformations with respect to geo- 
metrical distortions of 3D images. These distortions will be caused by 
1) 3D translations q — > q -|- w, 2) 3D rotations q — > Qo(q + 
where Qo = 3) dilatation: q — > Aq, where A G R+. If /(q) is 

an initial image and /AQow(q) its distorted version, then /AQow(q) := 
/(AQo(q + w)Qo^). 

Theorem 4 The central moments AAp{f} of the 3D grey-level image 
/(q) are relative -valued invariants 

JpihQo^} :=A^p{/aSow} = 

with respect to the group Afi{QC^^) with left -valued multipliers 
AP+3qp right A^^ -valued multipliers Qq^, respectively (see Fig. 

8), and the absolute values of normalized central moments 



\Mp\ = Mp{fxQo^{q)}Ml ^{fxQo^ 



(q)} 



-Mr'{/AQ, 



OW 



(q)} 
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/(AQo(q + w)Q| 



M 



-u 



Ap+3QP{.}Q/ 

Mp{f(q)} ^ AP+3Q(;^Mp{/(q)}Q/ 



Figure 8. Transformations of ^f^-valued moments with respect to the group 

Afr(gcf^) 



are absolute sealar-valued invariants with respeet to the same group, i.e., 

^P = Wp{hQ.^)\ = Wv{f}\- 

4.3 Moments and invariants of color 2D and 3D 
images 

Let QC 2 ^ := {z = x\ +1x2 \ x\,X 2 G R; /^ = — 1,0,+1} be a gen- 
eralized spatial complex plane. Then the color image fcoi{x,y) can be 
considered as a triplet-valued function of the generalized complex vari- 
able z = xi + 1x2, he., as fcoi{x,y) = fcoi(z), where z G . Let 
z G QC 2 ^ be spatial and A G be color triplet numbers. Then all 
products Z.4 will be called spatial-eolor numbers (or Hurwitz numbers). 
They form the 6D spaee-eolor Hurwitz algebra Aq^^°'‘ := A^^ (8> .4^°^ 
We assume that all spatial hyperimaginary units commute with all eolor 
units. Therefore, = A^^ (8> A^°’' = A^^^ (8> A^^. Let c be the 

centroid of the image fcoi(z)- 

Definition 10 Funetionals of the form 

Mp = [ (z - c)Pfcoi{z)dz 

are ealled the eentral -valued moments of the eolor image fcoi(z). 

Let A = (aiu,z,ch) G A^'°''. Let us clarify the rules of moment transfor- 
mations with respect to distortions of color and geometry of the initial 
images. If fcoi(z) is the initial image, then v,wf^;(2:) = .4fcoz(v(z-|-w)) = 




222 



COMPUTATIONAL NONCOMMUTATIVE ALGEBRA 



oz«f«n(v(z + w)) • eiu + Zcfcfc/i(v(z + w)) • Ech denotes its LCG(3)-color 
and AfF(^C 2 ^)-geometrical distorted copy. 

Theorem 5 The central moments A4p of the color image fcoz(z) are 
relative -valued invariants 

:7p{v,wf,^z} := = A^rP\^r\^Mp{^col} (5) 



with respect to the spatial-color group AfF(^C 2 ^) xLCG(3) with 
valued multipliers AvP|vp (see Fig. 9), and the absolute values of the 
normalized central moments 



Wv 




v.wf^zl 



are absolute scalar-valued invariants, with respect to the same group, 



Z. C. j — 




Mp{fcol}\- 



fcoz(z) 



A 



v(z + w) 



Afcoz(v(z + W)) 



M 



M 



+ AvP|v|2 



AvP|vpMp{fcoz(z)} 



Figure 9. Transformations of relative Tg^'^°*-valued moments with respect to the 
spatial-color group X LCG(3) 



Let fcoz(q) be a color 3D image depending on the pure vector generalized 
quaternion variable q G Vec{Af^} = QTZ^. If q G CTZ^ C is a 
generalized quaternion and A G A^°* is a color number, then all products 
of the form qA are called spatial-color quaternions. They form a space- 
color algebra := Af^ G A^°* = A^°^ G Af^. 

Definition 11 Functionals of the form 

Mp{icoi} ■= (q - c)fcoz(q)dq 

dqee7^3 
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are called the one-index central ^'^^‘^“^-valued moments of the color 3D 
image fcoz(q), where p £ Q, and dq := dxdydz. 

Let us clarify the rules of moment transformations with respect to ge- 
ometrical and color distortions of 3D color images. If /coz(q) is an 
initial image and AQwf^z(q) its distorted version, then AQwf^z(q) := 

fcoz(q) ^ ^fcoz(AQ(q-h w)Q-i) 

AQo(q + w)QQ^ I 



M 



a?’+3q(;^{-}q/ + 



Figure 10. Transformations of ^'®^*^°Lvalued moments with respect to the spatial- 
color group Affl^Cg^) X LCG(3) 

Theorem 6 The central moments A4p{fcoi} of the 3-dimensional color 
image fcoii^l) are relative -valued invariants 

F[Ua„} :=.Wp{fie.w} = (.4A»+3Qg)M,{f„,}(Q-») 

with respect to the group x LCG(3) with left -valued 

multipliers AX^^^Q^ and with right Spin{A^^) -valued multipliers , 
respectively (see Fig. 10), and the absolute values of the normalized 
central moments 

|-A/’p{A£-owfco°z}l = -^p{A£’owfco°}-^0 {A£’owfco°zi / -^li^Sovf^col} 

are relative scalar-valued invariants, with respect to the same group, 
i.e., Tp = lA/^lA^’owfcozll = |-A/p{fcoz}|- 

5. Conclusions 

We have shown how Clifford algebras can be used in the formation 
and computation of invariants of 2D, 3D and nD color and multicolor 
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objects of different Euclidean and non-Euclidean geometries. The theo- 
rems stated show how simple and efficient the methods of calculation of 
invariants are that use spatial and color Clifford algebras. But how fast 
can the invariants be calculated? The answer to this question the inter- 
ested reader can find in [2] , where Eourier-Clifford and Eourier-Hamilton 
number theoretic transforms are used for this purpose. 
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Abstract 

The Cooley- Tukey FFT can be interpreted as an algorithm for the 
efficient computation of the Fourier transform for finite cyclic groups, a 
compact group (the circle), or the non-compact group of the real line. 
These are all commutative instances of a “Group FFT.” We give a brief 
survey of some recent progress made in the direction of noncommutative 
generalizations and their applications. 



Keywords: Fast Fourier transform, discrete Fourier transform, sampling, Gel’fand- 
Tsetlin bases. 

I. Introduction 

The Fast Fourier Transform or FFT is an efficient algorithm to com- 
pute the Discrete Fourier Transform {DFT). This is a linear transfor- 
mation, specifically realized in terms of the the (n x n) DFT matrix: 

V / jk 

which takes a vector of samples realized as a function / G and 
returns a collection of Fourier coefficients / G 

The DFT plays a crucial role in a wide range of applied activities, 
principally in the analysis of time series data. These natural quan- 
tifications of temporal phenomena presumably owe their origins to the 
observations of the first priestly mathematicians who made it their work 
to chart the course of heavenly bodies in the construction of the first 
calendars. Other early examples would include the analysis of the time 
course of temperatures, rainfall, and various meteorological data, pos- 
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sibly in the service of the study of the time series of agriculture (crop 
yields, etc.). Closer to our hearts and heads are the time series of health 
that are our EKGs and EEGs, which are perhaps, in turn, influenced 
by the seemingly random walk that are the time series which reflect the 
health of the financial markets. 

One approach to time series analysis is to view these phenomena 
as well-explained as the superposition of basic periodic phenomena: 
weather as a combination of diurnal and annual effects, blood pressure or 
hormone levels tracking some invisible and evolved pacemaker. Herein 
the DET is that transformation of the data that teases out the period- 
icities, taking the discrete or discretized signal and transforming it to a 
representation in terms of weighted frequencies. 

Direct calculation of a DET of length n is effected by the multiplica- 
tion of the n X n DET matrix with a vector of length n and so requires 
operations. When n is large even this quadratic calculation is too 
large. The great success of the EET is the reduction in complexity to 
0 (nlog 2 u) operations (with implicit small constant). 

1.1 Brief History of the Classical FFT 

The original EET was indeed due to Gauss, and has the astronomi- 
cal (although not religious) origins indicated above. Its story provides 
another proof that necessity is indeed often the mother of invention. 
Gauss was confronted with the computationally daunting problem of 
interpolating — by hand! — the periodic orbit of the asteroid Geres, which 
had suddenly gone missing. Gauss determined a means of building the 
interpolation on n points from two interpolations of n/2 points, and in 
so doing discovered the basic step in what is now the standard divide- 
and-conquer efficient algorithm. His discovery languished for centuries 
(cloaked in Latin and hidden away in little known writings) while a 
few centuries later it was rediscovered by (among others) Danielson and 
Lanczos in the service of crystallography but, surely most famously, by 
Gooley and Tukey [12] in the mid 1960s, this time not in the service of 
the discovery of missing heavenly bodies, but instead for the detection 
of hidden nuclear tests in the Soviet Union, as well as stealthy Soviet 
nuclear submarines. Eor full histories see [22]. 

The technical motivations for Gooley and Tukey’s rediscovery were 

(1) The efficient computation of the power spectrum of time series 
(especially sampled time series of very long length, so equivalently, 
the calculation of high frequency contributions) and 



(2) Efficient filtering (smoothing). 
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Item (2) is equivalent to the efficient computation of (cyclic) convo- 
lution. For vectors / and g of length n this is defined as 

n— 1 

f-kg{x) = ^f{x -y)g{y) 

y=0 

where the arguments are interpreted as integers mod n. {Linear convo- 
lution is that which corresponds to the generation of a vector of length 
2n from / by interpreting the samples f{i),g{k) as coefficients in a poly- 
nomial of degree n — 1, and then asking for the coefficient of x^ in the 
product. Note that this could be obtained by computing cyclic convolu- 
tion of / and g zero-padded to vectors of length 2n.) 

Note that direct computation of the convolution requires opera- 
tions. The identity 

f * g{k) = f{k)g{k) (2) 

shows how application of the DFT permits the filtering of / to be per- 
formed directly in the frequency domain via the assignation of a par- 
ticular frequency profile for g. When g takes only values zero and one, 
it has the form of a bandpass filter, and if the ones are restricted to a 
subsequence of indices, this nonzero interval is the passband. Lowpass 
filters restrict the passband to an initial segment and a terminal segment 
for highpass filters. 

The FFT enables fast convolution via the algorithm 

f,9 — ^ f,g — ^ fQg — ^ f*g 

where O is meant to indicate pointwise multiplication of the two vectors 
it separates. Note that the last step is accomplished via an inverse FFT, 
so that in total, the algorithm requires three FFTs and a single n point 
pointwise multiplication for a total of 0(n log n) operations. 

1.2 Group Theoretic Interpretations 

“The FFT” is actually a family of algorithms, all designed to com- 
pute efficiently the DFT (1). This linear transformation can be cast 
as a particular instance of any of a variety of mathematical operations, 
but the focus in this chapter is a group theoretic, indeed, representation 
theoretic point of view. Within this, there are at least three differ- 
ent interpretations, corresponding to either the case of finite, compact, 
or non-compact groups. We summarize these below, for each of them 
presents its own challenges for generalization. 

(1) Finite Groups — In this setting we view / as a function on the 
cyclic group of order n, Cn (isomorphic to Z/nZ). The FFT is the 
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efficient change of basis algorithm that takes a function written in 
terms of the basis of delta functions and re-expresses it in terms 
of the basis of sampled exponentials, 

I ^ {'^f(k)ek^ 

Uec„ J 



where ek{j) = 

(2) Compact Groups — In this case, the vector / is viewed as sam- 
ples of a function on the circle (i-e., samples of a periodic 
function). Any such function has a Fourier expansion, f(t) = 

where the Fourier coefficients are computed by 

an integral 

1 

f^e^TTityniU^t. 

In general, the FFT can be used to compute efficiently an approx- 
imation to these Fourier coefficients, but in the interesting ban- 
dlimited case, in which the function’s Fourier expansion is finite 
(i.e., there exists B >d such that f{i) = 0 for i > B), there is an 
exact quadrature or sampling rule that provides an exact formula 
for the (potentially) nonzero Fourier coefficients in terms of a DFT 
of length 2B + 1. 

(3) Non-compact Groups ~ In this last case, we view our discrete 
set of samples as arising from a complex-valued function / defined 
on the real line M. Once again, the Fourier transform is a linear 
transformation from time (or space) to frequency, this time given 
as the integral operator (for each x), 

Rx) = [ f{y)e-^^^y^dy. 

Jr 

As in the compact case the DFT might be used to approximate 
this integral, and once again there is a bandlimited theory (i.e., 
the case in which the Fourier transform only has finite support). 
In this case, the function / is determined by its equispaced sam- 
ples along the entire real line (i.e., so-called “Shannon sampling”). 
Consequently, the FFT provides a means for an efficient and quan- 
tifiable approximation to the computation of /’s frequency content. 

In summary, the FFT makes possible the efficient analysis of: (1) 
discrete periodic data viewed as a function on the discrete circle, that is. 
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a cyclic group of finite order {Cn)'-, (2) continuous periodic data, viewed 
as a function on the circle; and (3) continuous data, viewed as a function 
on the line. 

1.3 Noncommutative generalizations 

The groups Cm S^, and M are all commutative groups, i.e., the law we 
use to combine them obeys a commutative rule: x + y = y + x. Each of 
the above commutative group theoretic interpretations has, over the past 
generation, found generalization to the noncommutative setting, and the 
purpose of this chapter is to provide a window into this work. 

Abstractly, a group is simply a set closed under some associative multi- 
plication rule such that there is an identity element, and to each element 
there is an inverse. Classically, these arose as the symmetries of roots of 
polynomials, i.e., those arithmetic transformations that leave invariant 
a given polynomial, and from this they grew to encompass the notion 
of symmetry throughout mathematics and physics. They are in general 
noncommutative, i.e., usually xy ^ yx (think matrices!). As indicated 
above, they come in at least three general flavors - the three in which 
we are interested: Finite, Compact and Non- compact. 

(1) Finite groups — The most familiar commutative examples are the 
aforementioned cyclic groups, Cn, while of the noncommutative 
examples, the symmetric groups, Sn, the group of permutations of 
n elements, commonly realized as the group of all card shuffles of 
a deck of size n is perhaps the most familiar. 

(2) Compact groups — Standard examples come from the matrix 
groups whose entries are bounded in size. The orthogonal groups 
0{n) and the special orthogonal groups SO{n) (also called ro- 
tation groups — symmetries of the n — 1-dimensional sphere), and 
their complex analogues — unitary groups U (n) and special unitary 
groups SU{n), are examples. With their length-preserving prop- 
erties they are effectively the symmetries of space. 

(3) Non-compact groups — The invertible complex or real matrices, 
GL„(C) or CLn(S.) are well-known examples, and within these, 
the Euclidean motion groups are particularly useful. These are 

(for any n) the matrices of the form ^ ^ ^ ^ where a G SO{n) 

and b G M”. These occur naturally as symmetries of n-dimensional 
affine space. 
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1.3.1 Noncommutative DFTs. Given a complex- valued func- 
tion defined on a group G, its Fourier decomposition (analysis) is meant 
to be a rewriting in terms of a basis of functions that are nicely adapted 
to translation via group elements. It is in this sense a symmetry-guided 
decomposition. 

In the commutative case we have eigenfunctions of translation: If 
e(x) = then 

Tye{x) = e(x — y) = 

where Ty indicates the translation operator. 

In the noncommutative case there are no simultaneous eigenfunctions 
for all translation operators. This is both the source of frustration as well 
as the spur to art for the theory and application of the representation 
theory of noneommutative groups. This forces us to look for the next 
best thing which is closure of some linear space under the translation 
action. That is, a basis of functions ek{x) on the group that have the 
property 

TyOkix) = ek{xy~^) = '^Ty{k,C)ei{x). 

l 

In this way we see that the eigenfunctions are naturally replaced by 
functions that act like, and bear the name of matrix elements Ty{k,£), 
and it is essentially these functions which replace the sampled expo- 
nentials that create frequency space in the commutative case. When 
grouped together they give matrix representations of the group and com- 
prise what is called the dual of the group (denoted G). Their study is 
the subject of group representation theory. 

So in general we have, for any function / defined on a finite group G, 
the notion of a Fourier expansion 

/w = EypE f{kJ)T,{k,i) (3) 

peG 

where Cp is some constant depending on an irreducible representation p, 
the f{k, t) are the Fourier coefficients, and the matrix elements (which 
depend on x) now span the analogue of frequency space. The Fourier 
transform computes these Fourier coefficients, and it amounts to com- 
puting the discrete inner product of the function with the new basis of 
irreducible matrix elements. 

Should G be compact, the sum is infinite (in analogy with the sum 
over the integers in the case of the circle) while if G is non-compact, 
this sum is in general some sort of integral (cf. [9] for pointers to basic 
representation theory references). 
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This new basis effects convolution in a manner akin to the commuta- 
tive case: 

/ * g{k, ^) = X] rn)g{m, i) (4) 

m 

where f -k g{x) = Yly&c 

1.4 Organization of this chapter 

The majority of what follows focuses on the case of finite groups, 
for most of the progress has been in this area. This is the content of 
the next section. Included are generalizations of both the Cooley- Tukey 
FFT (decimation in time) in the guise of separation of variables group 
FFTs, as well as the Gentleman-Sande FFT (decimation in frequency). 
We also touch upon the large body of recent work devoted to the de- 
velopment of quantum (finite group) FFTs. Section three is devoted to 
compact group FFTs, almost exclusively compact Lie groups, while Sec- 
tion four discusses recent work in the difficult, but tremendously useful, 
noncompact case. Indeed, this raises the issue of both the utility and 
applicability of these algorithms, for while the abstract development of 
algorithms has epistemological value, it is even of greater interest when 
motivated by and subsequently applied to real problems. With this in 
mind, each section contains some indication and discussion of applica- 
tions, and indeed, we hope that this chapter might inspire many new 
uses. 

2. Finite group FFTs 

As mentioned, when G is finite and commutative, the number of oper- 
ations required is bounded above by 0{\G\ log |G|). For arbitrary finite 
groups G, upper bounds of 0(|G| log |G|) remain the holy grail in group 
FFT research. Implicit in the big-O notation is the idea that a family 
of groups is under consideration, with the size of the individual groups 
going to infinity. In 1978, A. Willsky provided the first noncommutative 
example by showing that certain metabelian groups had an 0(|G| log |G|) 
Fourier transform algorithm [55]. 

Two of the most important algorithms in the commutative case are 
the Cooley- Tukey FFT and the Gentleman-Sande FFT, the former of- 
ten described as decimation in time, while the latter as decimation in 
frequency, their similarity reflected in a natural isomorphism between 
the group and its dual that exists in the finite commutative case. In this 
section we describe in some detail the separation of variables approach 
[35] which generalizes the former, and an isotypic projection algorithm 
[32] which generalizes the latter. 
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2.1 Applications 

While the applications of Fourier analysis on commutative groups is 
now legion (see the Introduction in [8] for a truly mind-boggling list!), 
for finite noncommutative groups the list is still short, but constantly 
growing. 

To date, Fourier analysis on the symmetric group Sn seems to have 
found the most applicability. It has has been proposed and used to 
analyze ranked data. In this setting respondents are asked to rank a 
collection of n objects. As a result, each participant in effect chooses 
a permutation of the initially ordered list of objects. The counts of 
respondents choosing particular rankings then gives rise to a function 
on Sn for which Fourier analysis provides a natural generalization of the 
usual spectral analysis applied to a time series. Diaconis has used this 
to study voting data (cf. [14] for a discussion of this example, as well 
as others). More recently, Lafferty has applied this to the development 
of conditional probability models to analyze some partially ranked data 
[27]. 

In communications, Fourier analysis on finite matrix groups, SL 2 {p), 
the group of two-by-two matrices with determinant one with entries in 
a finite field, has made possible new developments in the area of low 
density parity check (LDPC) codes [28], and also proved instrumental in 
the construction of expander graphs that provide models for networks 
with high connectivity but relatively small numbers of links. 



2.2 Cooley-Tukey revisited 

The separation of variables approach generalizes the decimation in 
time FFT, which is essentially the guts of the Cooley-Tukey FFT. 

Assuming that n = pq (not necessarily prime), then decimation in 
time refers to the factorization of our time index i as 



i = iiq + £2 {0 < £1 < P ,0 < £2 < q) 



(5) 



which is coupled with a corresponding factorization of the frequency 
index k as 



k = k\ + k2P (0 < /ci < p, 0 < /c2 < <?) 



( 6 ) 



so that 

J{k) = 

<?2 h 



(7) 



where f{£\,£ 2 ) = f{£\q + £ 2 )- 

Notice that this rewrites a “one-dimensional” computation as a “two- 
dimensional” computation. The FFT organizes the calculation into two 
stages: 
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■ Stage 1 : For all ki , £2 compute 

= ( 8 ) 

h 

This requires at most pq^ operations. 

■ Stage 2: For all k\,k 2 compute 

f{kuk 2 ) = (9) 

t2 

This requires at most p^q operations. 

In toto, this gives an algorithm which requires pq{p + q) operations, 
rather than {pq)‘^, providing savings as long as factorization is possible. 

This approach generalizes nicely. Decimation in time is naturally 
replaced by group factorization, (first generally observed by Beth [5]), 
but the concomitant factorization of the dual (frequency) requires a little 
work. For this the machinery of Bratteli diagrams has proved to be 
of immense utility. For illustration we’ll revisit Cooley- Tukey in this 
setting. 



Bratteli diagram for Z/6Z > 2Z/6Z > 1. 



r'e C3 Cl 




Figure 1. The Bratteli diagram for Ce > C 3 > C\. 

In brief, the diagram above reflects a chain of subgroups Cg, > > 

1 , the nodes correspond to representations (frequencies) and one node 
is connected to another if when restricted to that subgroup it gives 
that corresponding representation. For example, evaluation of 65 on 
the multiples of 2 (which comprise the copy of 6*3 in Cq) is equivalent 
to simply evaluating 62 on C 3 . 

A full path in the Bratteli diagram is now a frequency, and the fac- 
torization (6) gives a labeling of the legs that make up the path. Stages 
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1 and 2 above can now be reinterpreted diagrammatically. Stage 1 re- 
quires the computation over all initial paths k\ and subgroup elements 
while Stage 2 becomes a computation over all coset representatives 
(.2 and full paths {k\, k 2 )- 

Separation of variables 

As described in [35] in the noncommutative case separation of vari- 
ables takes on a general form that requires more elaborate Bratteli dia- 
grams. Once again, the initial data is a chain of subgroups Gn >...,> 
G\ > Go = {!}, but the nodes at level i now correspond to matrix rep- 
resentations of Gj. A node r/ at level i is connected to p at level i-|- 1 by 
a number of arrows equal to the multiplicity of r/ in pjGj. 

(I) 

(3.1) 

(2.2) 

( 2 , 1 , 1 ) 

( 1 , 1 , 1 , 1 ) 

Figure 2 . The Bratteli diagram for S4 > S3 > S2 > 1 . 

For example, matrix representations for the group Sn correspond to 
partitions of n, hence the labeling in the Bratteli diagram for the sub- 
group chain ^4 > 5s > ^2 > 1 in Figure 2. Notice that the partition 
(2, 2) in Figure 2 reveals that full paths are no longer uniquely described 
by their endpoints. 

The arrows from rj to p correspond to mutually orthogonal Gj-equivar- 
iant maps of a given irreducible vector space. In this way each full path in 
the diagram corresponds to a basis vector of an irreducible representation 
of G. Bases indexed in this fashion are called Gel’fand-Tsetlin bases. 

Formally, this creates an isomorphism between the path algebra of the 
Bratteli diagram and the chain of semisimple algebras defined by the 
succession of group algebra inclusions C[Gj] C[Gi+ij. In this way the 
group algebra C[Gn] is realized as a multimatrix algebra (see e.g. [ 20 ]). 

Matrix elements are now indexed by pairs of paths with a common 
endpoint. The beauty of the Bratteli diagram formalism lies in the 
convenient characterization it gives for all types of structured matrices 
which can arise through the use of Gel’fand-Tsetlin bases. 

To begin, consider a G Gi < Gn- According to the above explanation, 
the entries of p{a) are indexed by pairs of paths from 1 to p in the 
corresponding Bratteli diagram. Since a G Gi, the matrix entry puv{o) 
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can be nonzero only when paths u and v intersect at level i, i.e., at Gi, 
and agree from level i to level n. In this case the matrix coefficient Pvw{o) 
is independent of the subpath from level i to n. This is precisely the 
diagrammatic realization of a block diagonal matrix with certain equal 
sub-blocks. It is for this reason that these are also sometimes called 
adapted bases. 

For another example, consider the situation in which a G Gn cen- 
tralizes Gj. Using the path algebra formalism, it is not too difficult to 
show that in this case Puv{o) can only be nonzero when u and v agree 
from level 0 to level j, then vary freely until they necessarily meet at p 
at level n. Here the matrix coefficient depends only upon the pairs of 
paths between levels j and n. 

Finally, any factorization of the group, say into elements in subgroups 
of the chain as well as their centralizers, then gives a factorization of 
representations into sums of products of matrix elements, which by the 
previous discussion are only nonzero in case very particular eompatibil- 
ity relations are satisfied among the corresponding sets of contributing 
paths. Complexity estimates are then computed in terms of counts of 
compatible diagrams, and also indicate a freedom of choice among a 
range of possible orders of evaluation, over which the complexity esti- 
mates may vary (see e.g. [29, 36] for the case of the symmetric group). 
The full formalism [29, 34] phrases all of this in the language of bilin- 
ear maps and bears some resemblance to the fundamental FFT work of 
Winograd [56]. 

2.2.1 State of the art. This separation of variables approach 
and its even more elaborate successors have been responsible for the 
fastest known algorithms for almost all classes of finite groups, including 
the symmetric groups [29] and their wreath products [35]. These are 
among the classes of groups for which OjG] log'll Gj Fourier transform 
algorithms are known. Other examples include the supersolvable groups 
[3] , while the algorithms for finite matrix groups and Lie groups of finite 
type still have room for improvement [33, 36]. 

2.2.2 Finite group quantum FFTs. By now there are many 
books and surveys available as introductions to quantum computing. 
Suffice it to say that the problem of computing a Fourier transform on a 
finite group in the quantum setting looks formally much like the classical 
setting. Using the usual bracket notation, over an arbitrary finite group 
G, this analogously refers to the transformation taking the state 

Y,f(z)\z) tothestate ^ 
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where / : G ^ C is a function with H/H 2 = 1 and f{co)ij denotes 
the i,jth entry of the Fourier transform at the representation to. The 
collection {|u)}t, represent a set of basis vectors for the Hilbert space in 
question. 

To date, the main applications of quantum computing, or more pre- 
cisely, the advantages attributed to quantum computing, have relied on 
the use of commutative Fourier analysis for the discovery of hidden peri- 
odicities. This is similar in spirit to the applied motivations behind the 
implementation of classical Fourier analysis, tasked to the revelation of 
the periodicities whose superposition comprise the Fourier representa- 
tion of a given time series. 

In the quantum setting “hidden periodicity” refers to the existence 
of a subgroup H in a given commutative group G such that, for a par- 
ticular function / defined on G, f is invariant under translation by the 
hidden subgroup, or equivalently, / is constant on cosets of some non- 
trivial subgroup H. For example, in Shor’s famous quantum factoring 
algorithm [51] G is the cyclic group Z* where n is the number we wish 
to factor, /(x) = r* mod n for a random r < n, H is the subgroup of 
Z* of index order (r). His quantum solution to the discrete log problem 
uses Z„ X Z„ for G. In Simon’s algorithm for the “XOR-mask” oracle 
problem [52] G is Z 2 with H given by a subgroup of order 2”“^. 

Interest in noncommutative HSPs derives from the relation to the 
elusive graph isomorphism problem: given undirected graphs A and B, 
determine if they are related by a simple permutation of the vertices 
(which preserves the connectivity relations). It would be sufficient to 
solve efficiently the HSP over the permutation group Sn in order to 
have an efficient quantum algorithm for graph automorphism (see, e.g., 
Jozsa [24] for a review). This was the impetus behind the development 
of the first noncommutative quantum FFT [4] and is, to a large degree, 
the reason that the noncommutative HSP has remained such an active 
area of quantum algorithms research. 

Most (if not all) quantum algorithms take advantage of a certain quan- 
tum parallelism by which the register (at any time a superposition of 
a collection of states — i.e., a particular vector in the Hilbert space) is 
updated via application of local unitary transformations which are, gen- 
erally speaking, the tensor product of identity matrices with unitary 
matrices of bounded size. Many of these can be applied simultaneously, 
in essence glued together to form a single quantum gate, and the full 
transform is then effected via the application of some sequence of such 
gates. The efficiency of any algorithm is then measured in terms of the 
quantum circuit depth. 
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The various sparse factorization FFTs are, in spirit, ready-made for 
quantum implementations and underly efficient quantum implementa- 
tions for [23] as well as some solvable groups [44]. A recent quantum 
adaptation of the separation of variables approach [39] provides a red- 
erivation of Beals’s original work, as well extensions to those classes of 
groups whose classical FFTs benefited from this framework. 

2.2.3 Sparse structured factorizations. In [17], sparse rep- 
resentation theoretic factorizations are put to work in helping to find 
factorizations of given linear transformations. In this work the goal is to 
describe a matrix as an element of the algebra of intertwining operators 
between two matrix representations. Having accomplished this, if the 
representations have sparse factorizations in general (e.g., of the type 
used in the separation of variables sorts of algorithms), then these can 
in turn be used to realize a sparse factorization of the original inter- 
twining element. The paper [17] discusses optimal applications of this 
approach for signal transforms such as the DFT, various DCTs (Discrete 
Cosine Transforms) and Discrete Hartley Transforms. This approach has 
been partially automated and is contained in the software library AREP 
(Abstract REPresentations) [43], which is in turn a part of the very in- 
teresting SPIRAL Project [40, 45], a multi-university effort directed at 
the automatic generation of platform-optimized software. 

2.3 Projection-based generalizations of the FFT 

The approaches explained above rely on what is commonly known as 
decimation in time, a recursive (or, depending on your point of view, 
iterative) traversal of the spatial (i.e., group) domain. Decimation in 
time often goes by the name of subsampling. 

An alternative, or perhaps more precisely, dual formulation is to in- 
stead recurse through the range, iteratively constructing the frequency 
content of the original data through successive projections which build 
out increasingly finer orthogonal decompositions. This is the philosophy 
behind decimation in frequency, originally due to Gentleman and Sande 
[19]. 

This idea has also found generalization in the context of computing 
isotypic decompositions of a function defined on a group or its homoge- 
neous space. This generalization hinges on the observation that through 
a judicious choice of group elements and their representing matrices it 
can be possible to find a collection of projection operators whose ap- 
plication can be scheduled in such a way so as to effect the requisite 
projections. 
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2.3.1 Nested projections: The Gentleman-Sande FFT. A 

DFT of length n effects the projection of the data / onto the n dis- 
tinct eigenvectors of the DFT operator given by the sampled exponen- 
tials. Equivalently, it is also the projection onto the eigenvectors of the 
cyclic shift operator acting on n-space via (j) = f{j + 1), 

where j + 1 indicates that the index is to be interpreted (mod n) . The 
DFT eigenvectors are precisely the basis which diagonalizes the shift 
operator — i.e., they are also the eigenvectors for 

Of course, the operator commutes with any of its powers. Sup- 
pose now that n = pq. Note that under the action of the 

vector space V = C” decomposes into p orthogonal and Tp^^-invariant 
( 7 -dimensional subspaces Vj = span{i5j, (5j+p, . . . , (jj_|_(g_i)p}, where 5^ de- 

(n) 

notes the standard ^th basis vector. It is clear that the action of Tf 
on any Vj is equivalent to the action of on C?, thus when restricted 
to the space Vj it is diagonalizable with eigenvectors and eigenvalues 
corresponding to the DFT of length q. 

Thus, we see that the operator Tp"'^ has only q distinct eigenvalues on 
V, one eigenspace Wj for each character of Zg, and by symmetry, each of 
these is of dimension p, and as an eigenspace is Tp""^ -invariant. Further- 
more, since Tp""^ commutes with T^\ there is a basis of simultaneous 
eigenvectors. Thus 

ITg = IT° © • • • © 

Note that the original DFT of length n is thus the projection of / onto 
the This suggests the following algorithm for computing the DFT: 

■ Stage 1: For j = 0, ...,(? — 1, compute the projection of / 
onto Wj. 

■ Stage 2: For each j and each A:, compute the projection of f^^^> 
onto W^. 

This particular fast Fourier transform is known as the Gentleman- 
Sande, or decimation in frequency., FFT (see [19]). 

2.3.2 Gentleman-Sande for finite groups. The above dis- 
cussion reveals that the decimation in frequency FFT can be viewed as 
a sequence of projections onto isotypic subspaces. In the commutative 
case these are the individual eigenspaces. For an arbitrary representa- 
tion this is the decomposition into invariant subspaces each of which 
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has an irreducible decomposition into copies of a single irreducible sub- 
space. Thus we can attempt to generalize Gentleman-Sande by finding 
a collection of operators and computing (in some order) projections onto 
their eigenspaces of a collection of simultaneously diagonalizable linear 
transformations [ 32 ]. 

For example, suppose that L{X) has three isotypic subspaces Vi, ¥2-, 
and V3. Thus L{X) = Fi © V2 © V3 and each / G L{X) may be written 
uniquely as f = fi + f 2 + f 3 where fi & Vi. Additionally, suppose that 
T and T' are diagonalizable linear transformations on L(X) such that 
the eigenspaces of T are Vi © V2 and V3, and the eigenspaces of T' are 
Vi and F2 © F3. We may therefore compute the fi by first projecting / 
onto the eigenspaces of T to compute /i + /2 and /s, and then projecting 
both /i + /2 and /s onto the eigenspaces of T' to compute /i, /2 and 
/s. Note that each computation is done with respect to a fixed basis of 
L(X). This process of decomposing L{X) = Fi © F2 © F3 is illustrated 
in Figure 3 . 



Fi © F2 © V3 



X \ 

Vl © V 2 ©3 

/ \ 1 

Vl V 2 ©3 



Figure 3 . Decomposing L{X) = Fi © F2 © F3 using T and T' . 



We call the pair {T, T'} a separating set for L{X) because it allows 
us to separate a representation into its isotypic components. 

From this point of view, the Gentleman-Sande FFT first computes 
a projection of the data onto the isotypic decomposition corresponding 
to the subgroup generated by (isomorphic to Z^) and then further 
decomposing each of these according to the decomposition of (iso- 
morphic to the fill group G). Thus is a separating set for 

Tjpq. 

More generally, suppose now that {Ti, . . . , T^} is a collection of diag- 
onalizable linear transformations on a vector space V whose eigenspaces 
are direct sums of the isotypic subspaces of V. For each isotypic sub- 
space F, let Ci = (pm, ■ ■ ■ , Rik) be the fc-tuple of eigenvalues where, for 
1 < J < Pij is the eigenvalue of Tj associated to F- If Q / Q' 
whenever F / Fq then we say that {Ti, . . . ,Tk} is a separating set for 
F. 
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The existence of a separating set {T\, . . . ,Tfc} for V means that the 
computation of the isotypic projections oiv can be achieved through 
a series of eigenspace projections as follows: 

Stage 1 Compute the projections of v onto each eigenspace for T\. 

Stage 2 For each i > 1, iteratively compute the projections of the pro- 
jections previously computed for Tj_i onto each of the eigenspaces 
of Ti. 

It is not difficult to see that the computed projections at stage k are 
precisely the isotypic projections of the vector v. 

We may easily find separating sets for V by looking to the conjugacy 
classes Ci,...,Ch of G. In particular, if Tj = class 

sum of Cj (with respect to p) and pij = \Cj\xi{Cj)/di, then the class 
sum Tj is a diagonalizable linear transformation on V whose eigenspaces 
are direct sums of isotypic subspaces, and pij is the eigenvalue of Tj that 
is associated to the isotypic subspace Vi ([32]). 

The complete collection of class sums forms a separating set of V. 
We may, however, be able to find much smaller separating sets than the 
complete collection of class sums. For example, the Gentleman-Sande 
FFT uses approximately log n of the n conjugacy classes (since the group 
is commutative each element forms a conjugacy class). Other specific 
examples where this gives a savings include the homogeneous spaces 
formed from distance transitive graphs and their symmetry groups as 
well as quotients of the symmetric group [32] . 

The efficiency of this approach depends on an efficient eigenspace pro- 
jection method. Since the separating sets we use consist of real symmet- 
ric matrices, in [32] Lanczos iteration is used. This is an algorithm that 
may be used to efficiently compute the eigenspace projections of a real 
symmetric matrix when, as in all of our examples, it has relatively few 
eigenspaces and when it may be applied efficiently to arbitrary vectors, 
either directly or through a given subroutine (see, e.g., [42]). Implicit in 
this iterated projection are notions of multiresolution analysis. See [18] 
for recent group theoretic interpretations of this. 

2.4 Open questions for finite group FFTs 

Other groups for which highly improved (but not 0(10] log"^ jOj)) algo- 
rithms have been discovered include the matrix groups over finite fields, 
and more generally, the Lie groups of finite type. See [37] for pointers 
to the literature. There is much work to be done finding new classes 
of groups which admit fast transforms, and improving on the above re- 
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suits. The ultimate goal is to settle or make progress on the following 
conjecture: 

Conjecture. There exist eonstants c\ and C 2 sueh that for any finite 
group G, there is a eomplete set of irredueible matrix representations 
for whieh the Fourier transform of any eomplex funetion on G may be 
eomputed in fewer than cilGI log^^ |G| sealar operations. 

Perhaps progress toward this goal will require new techniques. In- 
deed, it does seem as though the separation of variables approach has 
been pushed almost as far as it can go. One place to look, and indeed 
one of the more intriguing open questions in the development of FFT 
techniques, is for generalizations of those commutative FFT methods 
which are used for groups of prime order: Rader’s prime FFT [46] and 
the “chirp- 2 : transform” (the “chirp” here refers to radar chirp) [6, 47]. 

Both of these algorithms use an idea that rewrites the DFT (of prime 
length p) at nonzero frequencies in terms of a convolution of length p—1 
(which, since it is composite, can be computed efficiently using other 
FFT methods) while computing the DFT at the zero frequency directly. 
Rader’s prime FFT uses a generator g of Z/pZ’^, a cyclic group (under 
multiplication) of order p — 1, to write f{g~^) as 

p-2 

= /(O) + /(5“)e2"3“-'’/j’. (10) 

a=0 



The summation in (10) has the form of a convolution on Z/(p — 1)Z, of 
the sequence f'{a) = f{g‘^), with the function z{a) = 

Rabiner et al. [47] (see also [6]) make the change of variables jk = 
(j^ + — {j — kfi)/2 to obtain 



Af-l 

/(fe) = 

j-o 



This is a non-cyclic convolution of the sequence with the 

sequence , and may be performed using a cyclic convolution of 

any length M > 2N. Note that this gives an approach which rewrites 
the DFT in terms of a convolution that does not depend on N being 
prime. This method is commonly known as the ehirp-z transform. 

The discovery of noncommutative generalizations of these ideas would 
be very, very interesting. 
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3. FFTs for compact groups 

The DFT and FFT also have a natural extension to (continuous) 
compact groups as well. The terminology “discrete Fourier transform” 
derives from the fact that the algorithm was originally designed to com- 
pute the (possibly approximate) Fourier transform of a continuous signal 
from a discrete collection of sample values. 

Under the simplifying assumption of periodicity a continuous func- 
tion may be interpreted as a function on the unit circle, and compact 
commutative group, S^. Any such function / has a Fourier expansion 
defined as 

/gz 

where 

m = {f,ei)= r/(t)e2-'*df. (12) 

Jo 

If /(/) = 0 for |/| > A", then / is band-limited with band-limit N and there 
is a quadrature rule or sampling theory for /, meaning that the Fourier 
coefficients of any such / can be computed as a summation using only 
a finite set of samples. Thus, 

2N-2 / h \ 

N) = E jwxrd {wxt) 

where the factor 2 n-i should be viewed as a (constant) weight function 

with support at the equispaced points { 2 N-i }‘k=o‘^ (where the circle and 
unit interval have been identified). The FFT then efficiently computes 
these Fourier coefficients. 

A more general framework capable of encompassing all continuous 
compact groups and their quotients is easily stated: the irreducible rep- 
resentations of a compact group G are all finite-dimensional, and any 
square-integrable function / (with respect to Haar measure) has an ex- 
pansion in terms of irreducible matrix elements 

d\ 

/ = E E (14) 

AgA j^k=l 

where A is some countable set, denotes an irreducible representation 
of degree d\ < oo, and the implied convergence is in the mean. The 
Fourier coefficients {/(A)^^} are computed by integrals 

f{X)jk = dx{f,T^\) = dx f f{x)T^\{x)dx 

^ G 



(15) 




Recent Progress and Applications in Group FFTs 



245 



where dx denotes (the translation-invariant) Haar measure. 

In turn, a general FFT schema then requires a formulation of the no- 
tion of bandwidth, accompanied by a corresponding sampling theory, and 
an algorithmic component for the efficient evaluation of the quadrature, 
or FFT. 

3.1 Applications 

To date, it is the group of rotations in three space, 50(3), where 
most of the applications for FFTs on continuous, noncommutative com- 
pact groups have been found. Its representation theory is effectively 
that of the theory of spherical harmonics. One large source of applica- 
tions comes from the climate modeling community (see e.g. [53]) where 
spherical harmonics are used for spectral methods approaches to solving 
the relevant PDFs in spherical geometry. Further applications are to 
be found outside the atmosphere, as spherical harmonics expansions of 
the CMB are the source of new information about its fine scale inho- 
mogeneities which hope to provide new information about the shape of 
space and origins of the universe. 

Most recently, FFTs for 50(3), as applied to the development of fast 
convolution algorithms on 50(3) [25], have been used to develop search 
algorithms for shape databases. 

3.2 FFTs for compact groups — the work of 
Maslen 

The first FFT for a noncommutative and continuous compact group 
was the efficient spherical harmonic expansion algorithm discovered by 
J. Driscoll and D. Healy [15]. In this case, the Fourier expansion of a 
function on the 2-sphere, viewed as a function on 50(3)/50(2) (with 
50(2) identified with the rotations that leave the north pole fixed) has 
a natural notion of bandwidth given by the degree of the spherical har- 
monic. A sampling rule on the 2-sphere, equiangular in both latitude 
and longitude, gives a quadrature rule, and a function of bandwidth 
B (and Fourier coefficients) requires 0{B‘^) points. The story is 

completed with a fast algorithm log^ N) operations for N = B“^) 

that uses the three-term recurrence satisfied by the Legendre functions 
to produce a divide and conquer algorithm for its efficient evaluation. 

Some years later, this work was extended to the full compact setting 
by D. Maslen: a general notion of bandwidth consistent with the com- 
mutative and spherical notions [31], a sampling rule [30], and finally an 
FFT which also relies on three term recurrence relations satisfied by re- 
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lated orthogonal polynomial systems. What follows is a brief summary 
of this work 

There is a natural definition of band-limited in the compact case, 
encompassing those functions whose Fourier expansion has only a finite 
number of terms. The simplest version of Maslen’s theory is as follows: 
Definition 1. Let TZ denote a eomplete set of irredueible representa- 
tions of a eompaet group G. A system of band-limits on G is a deeom- 
position oflZ = sueh that 

■ [1] < oo for all b > 0 ; 

■ [2] bi < &2 implies that TZhi fL ^ 62 / 

■ [3] Uhl ® '^b2 C spanzUbi+b2- 

Let {Ub}b>o be a system of band-limits on G and f G Lf‘{G). f is 
band-limited with band-limit b if f{Tpf) = 0 for all \^Uh- 

The case G = provides the classical example. If Ub = {xj HjI ^ b} 
where Xj{z) = , then Xj ®Xk = Xj+k and the corresponding notion of 

band-limited (as per Definition 1) coincides with the usual notion. 

For a noncommutative example, consider G = SO{3). In this case the 
irreducible representations of G are indexed by the non-negative integers 
with V\ the unique irreducible representation of dimension 2A -|- 1. Let 
Ub = {La : A < 6 }. The Clebseh- Gordon relations 

j=|Ai-A 2 | 

imply that this is a system of band-limits for SO{3). When restricted to 
the quotient = SO{3)/ SO{2), band-limits are described in terms of 
the highest order spherical harmonics that appear in a given expansion. 

Maslen’s most general setting for a notion of band-limit develops 
a theory of band-limited elements for any filtered module over a fil- 
tered algebra. In the case of a connected compact Lie group G, Ug 
is defined to be the set of all matrix elements that come from repre- 
sentations whose highest weight is at most s. For the matrix groups 
SU{r -\- 1), Sp{r), SO{2r + 1), it is possible to choose a norm for which 
Ui is the span of all matrix representations with highest weight given by 
a fundamentally analytically integral dominant weight or zero. Band- 
width is thus defined in terms of lengths of factorizations in sums of 
products of such elements expressing a given matrix element [31]. 

The importance of developing the band-limited theory is that in this 
setting there exists a sampling theory or quadrature rule that allows the 
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Fourier coefficients to be computed exactly as finite sums. The following 
is the content of [30] , once the notion of bandlimit is arranged. 

Theorem 1 Let G he eompaet with a system of band-limits {TZh}b- For 
any band-limit b, there exists a finite set of points Xb C G sueh that for 
any funetion f G L‘^{G) of band-limit b, 

f(T-k) = E (17) 

xeXb 

for all A G TZb and some weight funetion w on Xb- 

Theorem 1 reduces the integrals (15) to summations, so that efficient 
algorithms can now be designed to perform the computations (17). For 
the classical groups U{n), SU{n), Sp{n), a system of band-limits is 
chosen with respect to a particular norm on the dual of the associated 
Cartan subalgebra. Such a norm || • || (assuming that it is invariant 
under taking duals, and ||a|| < ||/3|| -|- jlyjl for a occurring in <Si 'j) 
defines a notion of band-limit given by all a with norm less than a fixed b. 
The associated sampling sets XJ^ are contained in certain one-parameter 
subgroups. 

Implicit here are certain diserete speeial funetion transforms, which 
can often be reduced to certain diserete polynomial transforms 



N-l 

/(^i) = E (18) 

k=0 

where Pq, , Pn-i are a set of linearly independent polynomials with 
complex coefficients, {xq, . . . , XAr_i} are a set of N distinct complex 
points and {rco, . . . , wn-i} is a set of positive weights. The case of the 
DFT comes from choosing equispaced roots of unity for sample points, 
equal weights of one, and Pj{x) = xP Direct calculation of all the f{Pj) 
clearly requires N‘^ operations. 

If the Pj make up a family of orthogonal polynomials, then fast al- 
gorithms exist to speed the calculation. Here the idea is to use the 
three-term reeurrenee satisfied by these polynomials to create a divide- 
and-conquer algorithm which reduces transforms of degree n to sums 
of transforms of degree less than n, ultimately providing an 0(n log^ n) 
algorithm. (See [16] and references therein.) 

By using these sorts of complexity estimates, together with a sampling 
theory and a careful organization of the calculation (using the diagram- 
matic techniques explained above), Maslen is able to derive efficient 
algorithms for all the classical groups. 
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Theorem 2 ([31], Theorem 5) 

Assume n >2. 

(i) For U{n), Tx^{Tq) < 

(ii) For SU{n), Tx^iFC]) < 0(6t^*wSf/(n)+3n-2) 

(in) For Sp{n), Tx^{ni) < 0(6C^*mSp(n)+6n-6) 

where Txj^(TV() denotes the number of operations needed for the partie- 
ular sample set Xf and representations for the assoeiated group. 

3.3 Approximate techniques. 

In the bandlimited case, Maslen’s techniques are exact, in the sense 
that if computed in exact arithmetic, they yield exact answers. Of 
course, in any actual implementation, errors are introduced and the 
utility of an algorithm will depend highly on its numerical stability. 

There are also “approximate methods” , approximate in the sense that 
they guarantee a certain specified approximation to the exact answer 
that depends on the running time of the algorithm. For computing 
Fourier transforms at non-equispaced frequencies, as well as spherical 
harmonic expansions, the fast multipole method and its variants are used 
[21]. Multipole-based approaches efficiently compute these quantities 
approximately, in such a way that the running time increases by a factor 
of log(i) where e denotes the precision of the approximation. Another 
approach is via the use of quasi- elassieal frequeney estimates for the 
relevant transforms [38] . It would be interesting to generalize these sorts 
of techniques to compact groups and their quotients. 

3.4 Open question 

Maslen’s work effectively creates uniform sampling grids with con- 
comitant quadrature rules, but it may be possible that some applica- 
tions may require nonuniform grids. In the commutative case, exam- 
ples include applications in medical imaging and other forms of non- 
invasive testing. Noncommutative examples might include astrophysi- 
cal, weather, and climate data. The corresponding measurements are 
rarely equidistributed (in particular, there are many large uninhabited 
regions in which the data is never taken) and, in fact, these two variable 
expansions generally use grids which evenly sample in one direction, but 
use Legendre points in the other [41, 53]. For example, as applied to the 
analysis of the cosmic microwave background, this is meant to provide a 
sampling that is sparse at “the center,” which corresponds to avoiding 
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our own galaxy. It would seem to be of great interest to push forward 
the wealth of work done in the commutative setting (see e.g. [1] and the 
many examples therein). 

4. Noncompact groups 

Much of modern signal processing relies on the understanding and 
implementation of Fourier analysis for L^(R), i.e., the noncompact com- 
mutative group R. It is only fairly recently that noncommutative, non- 
compact examples have begun to attract significant attention. 

In this area some of the most exciting work is being done by G. 
Chirikjian and his collaborators. They have been concerned primar- 
ily with the Euclidean motion group SE{n). Recall that the motion 
groups are given as semidirect products of with the rotation groups 
SO{n), realized asn-|-lbyn-|-l matrices of the form 




where A G SO{n), v G and 0 „ denotes the all zero row vector of 
length n. This provides an algebraic mechanism for gluing together the 
group of additive translations with rotations. 

Their motivation comes from a diverse collection of applications, rang- 
ing among robotics, molecular modeling and pattern matching. Appli- 
cations to robotics come from the problem of workspace determination 
for discretely actuated manipulators. A standard example is a robot arm 
and a standard problem in motion planning is to determine the set of 
reachable configurations, as well as to plan a path to move from one 
configuration to another. The configuration of the end of the arm can 
be described with two parameters: a position in space (a vector in 
for n = 2 or 3) as well as an orientation (an element in SO{n)), i.e., an 
element of SE(n). One sort of design paradigm is to build a robot arm 
as an assembly of a sequence of basic modules, so that the arm takes on 
a worm-like or cillial form. Any single basic unit will have some finite 
discrete set of reachable states, defining a discrete probability density in 
SE{n). This is the workspace of a single unit. The workspace of the full 
arm (defined as the linked assembly of m of these basic units) is then 
given as the m-fold convolution of the fundamental workspace. This is 
called the workspace density. Applications to polymer science are anal- 
ogous, with similar modeling considerations used to describe the motion 
of a given end of a polymer (such as DNA) relative to its other end. 
These are but two examples. For details, as well as other applications 
see [9] and the many references therein. 
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Just as the classical FFT provides efficient computation of convolu- 
tions on the line or circle, so does an FFT for SE{n) allow for efficient 
convolution in this setting, replacing direct convolution by FFTs, matrix 
multiplications and inverse FFTs. 

In a collection of papers (see [9]) Chirikjian and Kyatkin create a 
computational framework for working with the representation theory of 
SE{3) acting on The matrix elements in this case are known and 
involve spherical harmonics, half-integer Bessel functions, glued together 
with Clebsch-Gordan coefficients. Explicitly, they find themselves in the 
position of having to compute (for a function /(r, R) of compact support 
on SE(3)) 



f P ,m' 

III 

JueS2 JreR3 JR&SO{3) 

e 



^ U^;^,{R)hl,^{vL)dud?rdR. 

n=—i' 



(19) 



where the /I’s and t/’s are defined in terms of generalized Legendre func- 
tions. 

Computation is now effected via a host of discretizations on M^, SO{3), 
and the dual index p, as well as some assumptions on the number 
of harmonics used to describe /. The exponent pu implies the need 
to convert from a rectangular to a polar grid and so there is also 
an interpolation (through splines) used. The complexity of the final 
separation of variables style algorithm is then given by gluing together all 
the appropriate fast algorithms (FFTs, fast interpolation, fast spherical 
harmonic expansions, fast Legendre expansions). 

The details of this analysis are found in [26]. This paper also contains 
an analogous discussion for SE{2) as well as the discrete motion groups 
defined as the semi direct product of translations (M^) with any of the 
(finite number of) finite subgroups of SO{3). 



4.1 Open questions 

To date, the techniques used here are approximate in nature and in- 
teresting open problems abound. Possibilities include the formulation 
of natural sampling (regular and irregular), band-limiting and time- 
frequency theories. The exploration of other special cases beyond the 
special Euclidean groups, such as semisimple Lie groups (see [2] for a 
beautifully written succinct survey of the Harish-Chandra theory), is 
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also intriguing. “Fast Fourier transforms on semisimple Lie groups” has 
a nice ring to it! 
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Abstract Abelian group DSP can be completely described in terms of a special 
class of signals, the characters, defined by their relationship to the trans- 
lations defined by abelian group multiplication. The first problem to be 
faced in extending classical DSP theory is to decide on what is meant by 
a translation. We have selected certain classes of nonabelian groups and 
defined translations in terms of left nonabelian group multiplications. 

The main distinction between abelian and nonabelian group DSP 
centers around the problem of character extensions. For abelian groups 
the solution of the character extension problem is simple. Every charac- 
ter of a subgroup of an abelian group A extends to a character of A. We 
will see that character extensions he at the heart of several fast Fourier 
transform (FFT) algorithms. 

The nonabelian groups presented in this work will be among the sim- 
plest generalizations of abelian groups. A complete description of the 
DSP of an abelian by abelian semidirect product will be given. For such 
a group G, there exists an abelian normal subgroup A. A basis for sig- 
nals indexed by G can be constructed by extending the characters of A 
and then by forming certain (left-) translations of these characters. The 
crucial point is that some of these characters of A cannot be extended 
to characters of G, but only to proper subgroups of G. In contrast to 
abelian group DSP expansions, expansions in nonabelian group DSP 
reflect local signal domain information over these proper subgroups. 
These expansions can be viewed as combined, local image-spectral im- 
age domain expansions in analogy to time-frequency expansions. 

This DSP leads to the definition of a certain class of group filters 
whose properties will be explored on simulated and recorded images. 
Through examples we will compare nonabelian group filters with clas- 
sical abelian group filters. The main observation is that in contrast to 
abelian group filters, nonabelian group filters can localize in the image 
domain. This advantage has been used for detecting and localizing the 
position of geometric objects in image data sets. 

The works of R. Holmes M. Karpovski and E. Trachtenberg are the 
basis of much of the research in the application of nonabelian group 
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harmonic analysis to DSP, especially in the construction of group trans- 
forms and group filters. Similar ideas in coding theory have been intro- 
duced by F. J. MacWilliams. 

During the last ten years considerable effort has taken place to extend 
the range of applicability of nonabelian group methods to the design of 
new filters and spectral analysis methodologies, as an image processing 
tool. Accompanying these application efforts, there has been exten- 
sive development of algorithms for computing Fourier transforms over 
nonabelian groups generalizing the classical FFT. 



Keywords: finite group harmonic analysis, group filters, image domain locality. 

1. Introduction: Classical Digital Signal 
Processing 

There are two disjoint themes in this work: the fundamental role 
played by translations of data in all aspects of digital signal processing 
(DSP) and the role of group structures on data indexing sets in defining 
these translations. 

The set of integers paired with addition modulo N is the simplest 
finite abelian group. The term abelian refers to the fact that addition 
modulo is a commutative binary operation. We denote this group by 
Z /N and call it the group of integers modulo N. 

Denote by C(Z/N) the space of all complex valued functions on X/N. 
C{Z/N) is the space of iV-point data sets. 

For y £ Z/N, the mapping T(y) of C{Z/N) defined by 

(T(y)/) (x) = fix -y), / G C{Z/N), x £ Z/N, 

is a linear operator of C{Z/N) called translation by y. 

For g £ £(Z/A^), the mapping C{g) of CiZ/N) defined by 

C{g) = Y. 9{y)T{y) 

yCLjN 

is a linear operator of CiZjN) called convolution by g. If / G C{Z/N), 
then 

iCig)f){x)= Y fix-y)giy), x g z/w 

y&Z/N 

1.1 Fourier Analysis 

Fourier analysis provides the tools for the detailed analysis and con- 
struction of translation-invariant subspaces of C{Z/N). 

For y £ Z/N, define Xy ^ C{Z/N) by 

X,(x)=e2-^, 



X G Z/N. 




Group Filters and Image Processing 



257 



The collection 

{x, : y G Z/N} 

is a basis of C{Z/N) called the exponential basis. 

A mapping x'- Z/N — > is called a eharaeter oi Z/N if x satisfies 

x{x + y) = x{x)x{y), X, ye Z/N. 

denotes the set of nonzero complex numbers. Direct computation 
shows that every exponential function is a character of Z/N . Eventually, 
we will show that the collection of characters and exponential functions 
olZ/N coincide. 

We want to show that the characters of Z/N can be defined in terms 
of the translations of C{Z/N). 

Suppose Ad is a collection of linear operators of a vector space V. For 
M G Ad, a nonzero vector u in E is called an M- eigen veetor if there 
exists a e C such that 

Mv = av. 

We call a the eigen value of M corresponding to the eigen vector v. 
A nonzero vector u in E is called an Ad-eigen vector if v is an M-eigen 
vector for all M G Ad. In this case, there exists a mapping a : Ad — > C 
such that 

Mv = a{M)v, M G Ad. 

A basis of E is called an Ai- eigen veetor basis if every vector in the 
basis is an Ad-eigen vector. Equivalently, a basis for E is an Ad-eigen 
vector basis if and only if the matrix of every M G Ad with respect to 
the basis is a diagonal matrix. 

Theorem 1 If x 'Is a eharaeter of Z/N , then x 'Is a T{Z/N)-eigen 
veetor. 

Theorem 2 If f is a T{Z/N)-eigen veetor and 
T{y)f = a{y)f, y G Z/N, 
then a is a eharaeter of Z/N and 

f[y) = a{-y)f{0) = a{y)*f{0), yeZ/N. 

Proof 

f{x -y) = a{y)f{x), x, ye Z/N. 



Setting y = 0, 



f{x) = a{0)f{x), X e Z/N. 
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Since / 7 ^ 0, a(0) = 1, a is a character of Z/N. Setting x = 0 and 
replacing —y by y completes the proof. 



Theorem 3 If x is a T(fL/N)-eigen vector and x(0) = 1, then x is a 
character ofZ/N. 

By Theorems 1 and 3 the characters of Z/N are uniquely characterized 
as the T(Z/iV)-eigen vectors taking the value 1 at the origin of Z/N. 

We will now complete the picture and show that the characters of 
Z/N form a T(Z/A^) -eigen vector basis. 

Linear algebra theorem If M is a commuting family of linear oper- 
ators of a vector space V and each M £ M has finite order, then there 
exists an M-eigen vector basis. 

Theorem 4 The set of characters of Z/N is a T{Z/N)-eigen vector 
basis. 

Proof Choose a T(Z/A^)-eigen vector basis 

{Xo, • • -NN-i} 

consisting of characters of Z/N. We will show that if x is character of 
Z/N, then x = Xru for some n, 0 < n < — 1. Write 

7V-1 

X = X] C({n) ec,0<n< N -1. 

n=0 

Without loss of generality, assume a(0) / 0. For y £ Z/N, since 

N-l 

T{y)x = x(-y)x = X] «(^)xn(-y)xn, 

n=0 

we have 

x{-y)a{n) = Xn{-y)a{n), 0<n<N-l. 

Setting n = 0, ioT y £ Z/N, 

x{-y) = xo{-y) 

implying X = XO; completing the proof. 



2. Abelian Group DSP 

Suppose A is an abelian group and C{A) is the space of all complex 
valued functions on A. 
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2.1 Translations 

For y G A, the mapping T{y) of C{A) defined by 

{T{y)f){x) = f{y~^x), x e A, f e C{A), 

is a linear operator of C{A) called translation by y. 

For g £ C,[A)^ the mapping C{g) of C{A) defined by 

C{g) = Y,9{y)T{y) 

y&A 

is a linear operator of C[A)^ called convolution by g. 

By definition 

f *g{x) = '^f{y)g{y~^x), x £ A. 
yeA 

Theorem 5 Convolution is a commutative algebra product on C[A). 

The algebra formed by C{A) paired with convolution product is called 
the convolution algebra over A. 

2.2 Fourier Analysis 

Suppose CA is the group algebra of A. 

A mapping x : A — > is called a character of A if x satisfies 

x{xy) = x{x)x{y), x,y £ A. 

A character of A is simply a group homomorphism from A into C ^ . 

Denote the set of all characters of A by A*. Under the identification 
between C{A) and CA, every character r of A can be viewed as a formal 
sum in CA 

r = t{x)x. 

x&A 

Multiplication of characters will always be taken in CA. 

Example 6 As elements in C(C 2 x 0-2), the characters of C 2 x C 2 are 
the formal sums 

00,0 = I + X1+X2 + X1X2, 

01.0 = 1 - + X2 - X 1 X 2 , 

00,1 = 1 + - X2 - X 1 X 2 , 

01.1 = 1 - - X2 + X 1 X 2 . 
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The mapping tq : A — > defined by to(x) = 1, x G A, is a 

character called the trivial character of A. As an element in CA, 

to = J2x. 

The importance of characters in DSP is a result of their invariance 
under left multiplication as stated in the following theorem. 

Theorem 7 If y G A and t is a character of A, then 

yr = r(y“^)r. 



Example 8 In CCn, if 

N-l 

m=0 

then 

7V-1 7V-1 

^ ^ ^ y~nr 

m=0 m=0 

Corollary 10.1 /// G CA and t is a character of A, then 

fr = /(r)r, 

where 

f(T) = 

y&A 



Theorem 7 implies every character r of A is an L(A)-eigen vector, 

L{y)T = yr = r(y“^)r, ?/ G A. 

The eigen value of the eigen vector r of L{y) is r(y“^). 

Theorem 9 A* is the unique L{A) -eigen vector basis of CA satisfying 
the condition that the value of each basis element is 1 at the identity 
element 1 of A. 

The Fourier analysis over A will be developed in terms of the proper- 
ties of the characters of A in CA. 
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t\ 



Nt, r = a, 
0, r / A. 



A nonzero element e in CA is called an idempotent if = e. Two 
idempotents ei and C 2 in CA are called orthogonal if 6162 = 6261 = 0. 
In the language of idempotent theory Theorem 10 says that the set 

-r:T£A 

is a set of pairwise orthogonal idempotents. 

Since A* is a basis of the space CA, we can write 

1 = oi{t) G C. 

tGA* 

For any A G A*, 



A = A • 1 = a(r)Ar = Na{X)X 

tGA* 



and a(A) = proving the following. 



Theorem 11 



1 = 



N 



E 

T&A* 



T. 



A set of pairwise orthogonal idempotents T in CA is called complete 
if 

i = E- 

e£l 

In the language of idempotent theory Theorems 10 and 11 say that the 
set 

' 1 



is a complete set of pairwise orthogonal idempotents. 



Corollary 10.2 



1 

N 




X = 1, 
X / 1, 



X G A. 
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By Corollary 10.1 and completeness every / G CA can be written as 

tGA* tGA* 

We call this expansion of / the Fourier expansion of /. 

fi'r) = '^f{y)T{y~^), tgA*, 

y&A 

is called the Fourier eoeffieient of / at r. 

2.3 Character Extensions I 

In this chapter we will show that every character of a direct product 
A = B X C oi abelian groups B and C can be represented as a product 
in Cvl of a character of B with a character of C. 

More generally, we will relate the characters of a subgroup of an 
abelian group to the characters of the group through a construction 
called eharaeter extensions. The generalization of this construction to 
nonabelian groups will occupy a great deal of the later chapters. A is 
an abelian group throughout this chapter. 

2.4 Direct Products 

Suppose i? is a subgroup of A and p is a character of A. The mapping 
Pb ■ B — > defined by 

Pb{x) = p{x), xeB, 

is a character of B called the restrietion of p to B. As an element in 

CA, 

PB = '^ p{x)x. 

x^B 

Theorem 12 Suppose A = B x C . If p is a eharaeter of A, then p = 
PbPc, the produet in CA of the eharaeters pB and pc of B and C defined 
by the restrietions of p. 

By Theorem 12 the character p of A = BxC is represented as a prod- 
uct in CA of a character of B with a character of C. The representation 
is unique. 

Theorem 12 easily extends to direct product factorizations of A having 
any number of factors. 

Example 13 The character 



I + Xi - X2 - X\X2 - X3 - X1X3 -I- X2X3 -I- X1X2X3 
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of C'2(xi) X C2{x2) X C2{x^) is the product 

(1 + xi)(l - X 2){1 - X3 ) 

of the characters 1 + xi, 1 — X2, 1 — X3 of C'2(xi), C'2(x2), C2(x3). 

Theorem 14 Suppose A = B x C . If t and X are eharaeters of B and 
C, then the produet p = tX in CA is a eharaeter of A. 

Proof By Theorem 7 if x = uv, u € B, v € C, then 

xp = {ut){vX) = t{u~^)X{v~^)tX = p{x~^)p, 

implying p is a character of A, completing the proof. 



2.5 Character Extensions 

Suppose B is a subgroup of A and r is a character of B. A character 
p of is called an extension of r if r is the restriction of p to 
Consider a complete system of S-coset representatives in A, 

{ys'-f<s< 5}. 



Every x ^ A can be written uniquely as x = HsZ ^ 1 < s < z £ B. 
Suppose p is an extension of r. Then 

s 

p = ^ p{x)x = p{ysz)vsz. 

x&A s=l z&B 



Since p{ysz) = p{ys)p{z) = p{ys)T{z), 

/ s \ / 



P = 



^p{ys)ys 



Z z = 



^p{ys)ys 



\s=l 



\z£B 






proving the following result. 



Theorem 15 Suppose B is a subgroup of A and t is a eharaeter of B. 
If p is a eharaeter of A extending t, then 




We can use Theorem 15 to characterize the characters extending r. 
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Theorem 16 Suppose B is a subgroup of A and t is a character of B. 
A character p of A is an extension of r if and only if 

^ 

Assume for the rest of this section that B is a subgroup of A. For a 
character t oi B denote by 

extr{A) 

the collection of all characters of A extending the character t of B. We 
will show that there always exists a character of A extending r and 
describe extr{A). A = B x C, then by Theorems 12 and 14 extr{A) 
is the set of all products in CA of the form tA, where A is a character 
of C. 

Example 17 The characters of Ci 2 {x) extending the character 

3 



T = 



E 

n=0 



yUxSn 



V = 



of Ci 2 {x^) have the form 



Pw — 



11 

E 

n=0 






W = V. 



This will be the case if and only if 

For each such w we can write the corresponding extension py^ as 

Pw = + wx + uP'x^), 

which is the factorization of py, given in Theorem 15. In this case we 
also have the direct product factorization, 

Cl 2 {x) = Ci 2 (x 3) X Ci 2 {x^) 



and can write 

Pw = x{l + + u^x^), 

where 

1 + ux'^ + 

is the character of C'i 2 (x^) defined by u = 
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3. Nonabelian Groups 

Example 18 The 3x3 cyclic shift matrix 



5s 



0 1 0 
0 0 1 
1 0 0 



and the 3x3 time-reversal matrix 



^3 



1 0 0 
0 0 1 
0 1 0 



satisfy the relations 

= * 53 "' + h, , 5 | = h = Rl RsSsRs^ = 



Denote by Vq{S 3 , R 3 ) the collection of all matrices 



{/ 3 , ^ 3 , 5 |; Rs, SsRs, SiRs}. 



By the above relations T>q{Ss, R^) is closed under matrix multiplication. 
For example 

RsSs = S^^Rs = SiRs. 

Since R 3 S 3 / S 3 R 3 , Vq{S 3 , R 3 ) is a nonabelian group of order 6. 

In general we have the following result. Assume N >3. 

Theorem 19 The N x N cyclic shift matrix Sn and the N x N time- 
reversal matrix Rn satisfy the relations 5]^ In, 0 < n < N , and 

S§ = In = Rn, RNSNRN = Sff^. 

The set of all products T> 2 n{Sn, Rn) of the form 

: 0 < n < N, 0 < j < 2} 

is a nonabelian group of order 2N. 

The group T> 2 n{Sn, Rn) is a matrix representation of the dihedral 
group of order 2N. 
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3.1 Normal Subgroups 



Abelian group constructions do not automatically extend to non- 
abelian groups. The identification of a distinguished class of subgroups, 
the normal subgroups, is one of the most significant insights in the de- 
velopment of nonabelian group theory. 

The concept of a normal subgroup of a nonabelian group G is closely 
related to a certain class of automorphisms, the inner automorphisms of 
G. We will discuss these concepts in this section. 

Suppose H and K are subgroups of G. We say that K normalizes H 
if 



yHy-^ = H, y e K. 



H IS a normal subgroup of G if G normalizes H. 



3.2 Quotient Groups 

Suppose, to begin with, that H is any subgroup of G. 

For y G G, the set 

yH = {yx : X G H} 

is called the left H-eoset in G at y. Right H-cosets can be defined in an 
analogous manner. 

Theorem 20 Two left H-eosets are either equal or disjoint. 

The collection of left H-cosets in G forms a partition of G. Denote 
this collection by G/H and call G/H the quotient spaee of left i?-cosets 
in G. 

Theorem 21 If y, z G G, then yH = zH if and only if z~^y G H. 

A set 

formed by choosing exactly one element in each left //-coset is called a 
eomplete system of left H-eoset representatives in G. 

Theorem 22 If H is a normal subgroup of G, then the produet 

(yH){zH) = yzH, y, z G G, 

is a group produet on G/H with H as identity element and 

(yH)-^ = y-^H, y G G. 

The group G/H is called the quotient group of left ff-cosets in G. 
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3.3 Semidirect Product 

Suppose H is a normal subgroup of G. We say that G splits over H 
with complement K ii G = HK and H K = {!}. IfG splits over 
H with complement K, then we say that G is the internal semidirect 
product H<^ K. The usual argument shows that G = H K ii and only 
if every x G G has a unique representation of the form x = yz, y G H, 
z G K. In general, the complement K is not uniquely determined. 

liG = H^K and K is a normal subgroup of G then [ 77 , K] = {1} and 
G is the direct product H x K. What is new in the internal semidirect 
product is the possibility that K acts nontrivially on H. 

Suppose H and K are groups and 

'if : K — > Aut{H) 

is a group homomorphism. The semidirect product H is the pair 
consisting of the cartesian product H x K and the binary operation 

{yi, zi){y2, Z2) = {yi'i>zi{y2),ziZ2 ) , yi, V2 e H, zi,z2 g k. 

The semidirect product is a group having identity (1, 1) and 

inverse 



X ^ = {^^-i{y ^),z ^), X = {y,z) G 

3.4 Examples: Semidirect product constructions 

DSP algorithms will be designed for nonabelian group indexing sets 
formed mainly by semidirect product constructions. The groups will 
have the form A<^ H, where A is an abelian group. 

Consider the ring Z/N oi integers modulo N. An m G Z/N is called 
a unit if there exists an n G Z/N such that mn = 1 in Z/N. A unit in 
Z/N is an invertible element relative to multiplication in Z/N . The unit 
group U{N) can be characterized as the set of all integers 0 < m < A" 
such that m and N are relatively prime. 

Theorem 23 The mapping 

u — > uGU{N), 

is a group isomorphism from U{N) onto Aut{Cpj{x)). 

The group isomorphism in Theorem 23 establishes a one-to-one cor- 
respondence between the subgroups of U{N) and those of Aut{C]y{x)). 
Under this identification, we can form Ci\f{x)<^ K for any subgroup K 
of U{N) by the group isomorphism throughout this work. A typical 
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point in Cn{x)<^ K is denoted by 0<m<N,uGK with 

multiplication given by 

(x™, n)(x"', u) = uu), 0 < m, n < N, u, V G K, 

where m + un is taken modulo N. Identifying Cm{x) with the normal 
subgroup Cn{x) x {1} of Cn{x) ^ K and K with the subgroup {1} x K 
of Cn(x) K, we can write 

x^u = (x™,u), 0 < m < N, u e K. (1) 

Groups of the form Cm{x)^ K will be used in many examples as 
several DSP concepts are introduced in the following chapters. Although 
the identification in (1) is unambiguous as a description of an abstract 
group, we will use the modification, 

x'^ku = (x”^,u), 0 < m < N, u ^ U{N), 

with x'^ki = (x™,u-^). 

The reason for this modification is to avoid any confusion or awkward 
notation in viewing Cat (x)^ AT as a subset of the group algebra 

C{Cn{x)4K). 

For a prime p the unit group U{p) of T^jp \s a cyclic group of order 
p — 1 under multiplication modulo p. Choosing a generator y for U{p), 
we can write the elements of U{p) as successive powers of y, 

l,y, ..., yP~^, 

with yP~^ = 1. 

A typical point in G = Cp{x) ^ U{p) can be written as 
x”^/c^, 0<m<p, 0<j<p — 1, 

with y a generator of U{p). Multiplication in G is subject to the relations 

_ 1 _ Lp-l h T — -ryh 

•Ay X ^ iXjyiAy tAy ^y ' 

Other semidirect products can be constructed replacing U{p) by any 
subgroup of U (p) . 

Suppose N = piP 2 , with pi and p 2 distinct primes. A number the- 
oretic result asserts that U{N) = U{pi) x U{p 2 )- There exist yi, y 2 G 
U (N) such that every y G U (N) can be written uniquely as 

y = 2/f y2^ 0 < < Pi - 1, 0 < j2 < P2 - 1, 
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with yf ^ = 1 = ^ and yiy 2 = y 2 yi- 

Example 24 An arbitrary element in the group Cn{x)^U{N), N = 
P 1 P 2 , can be written uniquely as 

0 < m < N, 0 < ji < Pi - 1, 0 < j 2 < P 2 - I, 

with group multiplication subject to the relations 

_ I _ UPl-l — UP2-1 

'T* ry^yi. /y> rpy‘2, 

A/^1 ijU — liy ^^1 5 ^J/2 — *4/ ^J/2 ^ ^^1 ^y‘2 — ^J/2 ^2/1 * 

Consider the abelian group Cn{x) x CN{y)- Under the usual identi- 
fications, we can write a typical point as 0 < m, n < N , subject 

to the relations = 1 = y^ and xy = yx. We begin our discussion 
of Aut{CN{x) X C]\f{y)) by extending the concept of a unit group to 
matrices. 

The set M(2, Z /N) of all 2 x 2 matrices over Z/N is a ring with respect 
to matrix addition and matrix multiplication. The arithmetic required 
for the matrix addition and multiplication is taken in ZjN . I 2 denotes 
the identity matrix in M(2, Z/N). A matrix L G M(2, Z/N) is called a 
unit if there exists L' G M{2,Z/N) such that LL' = I 2 = L'L. L' is 
called the inverse of L in M(2, Z/N) and is uniquely determined when 
it exists. If L is a unit in M(2, Z/N), we denote by L~^ the inverse of 
L. If Li and L 2 are units, then 

{LiL 2){L^^L/^) = h = {L^^L/^)LiL2 

and L 1 T 2 is a unit with inverse . The set GL{/2, Z/N) of all units 

in M(2, Z/N) is a group under matrix multiplication in M(2, Z/N). We 
call GL{2,Z/N) the unit group in M{2,Z/N). GL{2,Z/N) can be 
characterized as the set of all L G M(2, Z/N) such that det{L) G U{N). 
det{L) is defined by the same formula for complex matrices but with 
arithmetic operations taken in Z/N. 

For 

L = [ 1 ^m{2,Z/N), 

define 4 'l : Cn{x) x Cn^p) — > Cn{x) x Cn^p) by 

^Lix^y-^) = 0 < m, n < W 

is uniquely determined by its values on the generators x and y, 
4'l(x) = x^^y^^, 4'L(y) = x^^y^U 
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Theorem 25 The mapping 

LgGL{2,ZIN), 

is a group isomorphism from GL{2,Z/N) onto Aut{CN{x) x Cn{v))- 

Under the group isomorphism in Theorem 25, there is a one-to-one 
correspondence between the subgroups of GL{2, Z/N) and the subgroups 
of Aut{CN{x) X G]\f{y)) and we can form 

{Cn{x) X CN{y)) ^K, 

for any subgroup K of GL{2,Z/N). A typical point is denoted by 
(x^y'^ , L), 0<m, n<N,LGK with multiplication given by 

{x^y’^,L){x^'y^',L') = {x^” y^” , L"), 



where 

m" = m + Liw! + L 2 U, n" = n + L^m' + L^n' , L = 

and V = LLh Under the usual identifications, we can write 

x^y^L = {x^y^, L), 0 < m, n < N, L £ K. 

As before, to avoid awkward notation in the group algebra setting, 
we will use the modified identification, 

x^^y^ki = {x"^,y^,L), 0<m,n<N, Le GL{2,ZjN). 

The discussion extends to any number of cyclic group factors as long 
as each factor has the same order. 

3.5 Nonabelian Group DSP 

The new problem raised by noncommutativity is that the characters 
no longer form a basis of nonabelian group algebras. As a result, there 
exist multidimensional irreducible ideals. Throughout G is an arbitrary 
nonabelian group of order N and CG is its group algebra. 

Example 26 The dihedral group T>2n = T>2n{x, kN-i) is the set of 
products 0<m<N,0<j <2, with multiplication subject 

to the relations 

x^ = 1 = k%_i, k^-ix = x~^kN-i- 



L\ L 2 
L3 L4 
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N-l 1 

^ ^ ^ ^ O-mjX (^m,j £ C, 

m=0 j=0 



with polynomial multiplication subject to the relations given for the 
group T> 2 n- 



Example 27 Set A = Cn{x) x Cn{v), G = ^<jC' 2 (fcc), where c = 
■ 1 O' 

1 -1 ■ 

In Figure 1 we display lines indexed by G and their translations under 
multiplications by elements from G. 



Figure 1. An image and its translates in CG 4 , = 64 




In the following figures, we show the effects of replacing kc by other 
elements of GL{2, h/N) of order 2. 

Set 



The first image in Figure 2 is the same as in Figure 1. The remaining 
images are its translates by left multiplications of elements of C 2 {kb). 



Figure 2. An image and its translates in C(A^ C^ikb)), = 64 
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G 5 = yl<^ Cg^kd), where A = Gat(x) x C]\[{y), N = 3-2^ for an integer 
K>2. Set M = 2^. 



-1 M + 1 
M -1 M 



Order G 5 by 

A Akj Akj ' 

Akl Akl Akd \ ’ 

where A is ordered as before. Figure 3 is an example of translations of 
data indexed by G 5 , M = 16. Starting with the top left image as an ele- 
ment in CG 5 , the remaining 5 images are obtained by left multiplication 
by successive powers of kd- 





Example 28 Varying group structures are placed on an indexing set of 
data, and products are computed corresponding to the group algebra. 
Gi is the abelian group C 2 n{x), where Gi is indexed by the successive 
powers of x. G 2 is the dihedral group 'D 2 n(x, k]\f-i)- G 3 is constructed 
as follows: For N = 2^ , M G Z, M > 2, + if = 1 mod N; f + 1 

generates a subgroup of U{N) of order 2. Set G 3 = Cn{x) {1, kN ,, }, 

^ 1 G U{N). The product in CG 3 is governed by the relations 

= 1, = X 2'^^/cjv^^. 
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Figure 4- Group algebra products 




G 3 is indexed by 

1, X, xkjv_^j ^, . . . , x^~^kjv_^^. 

Note that G 2 and G 3 are isomorphic groups. 

For N = 64, the first two plots in Figure 4 are those of 128 points of 
data. The product in Gi is the usual cyclic convolution of two sets of 
data and displayed in the third plot. The 4th and 5th plots are those of 
group algebra products in CG 2 and CG 3 . 

For N = 128, each of the groups determines an indexing of data of 
size 256. The first plots in 5 are those of 256 points of data. The product 
in Gi is the usual cyclic convolution of two sets of data and displayed in 
the third plot. The remaining plots are those of group algebra products 
in CG 2 and CG 3 . 

Example 29 Varying group structures are placed on the indexing set 
of two-dimensional data, and products are computed corresponding to 
the group algebra. For 0 < m, n < N, place the lexicographic ordering 
on the pair {m,n). Gi is the abelian group Gn{x) x C 2 N{y) with two- 
dimensional ordering given by 

G 2 is the abelian group (C'at(x) x Gigiy)) x 6 * 2 ( 2 ;) with two-dimensional 
ordering given by 

63 = {Gn{x) X GN{y)) 4{h,kz}, z = • The ordering of 

G 3 is the two-dimensional ordering given by 

(x-,y-); (x^,y^kz). 
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Figure 5. Group algebra products 






G4 is the nonabelian group {Cm{x) x C]\f{y)) {h, ku}, u = 



0 1 
1 0 



G 



GL(2,Z/N). The ordering of G4 is the two-dimensional ordering given 
by 



{x^,y^ku). 



For N = 32, each of the groups determines indexing of data of size 
32 X 64. Images in Figure 6 are log scaled intensity plots of data of size 
32 X 64. Images in Figure 7 are convolutions of images in Figure 6 in 
CGi, CG2, CG3 and CG4. 

3.6 Fourier Analysis 

A mapping p : G — > is called a character of G if it satisfies 



p{xy) = p{x)p{y), x,yeG. 



(2) 



A character of G is simply a group homomorphism of G into C ^ . In 
group representation theory the more general concept of a character of 
a group representation of G is defined and characters as dehned in (2) 
are called one- dimensional characters. We will not use this general 
concept. 
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Figure 6. Images of size 32 x 64 






‘'ll il'' 

. 

II. .1 • 




Figure 7. Group algebra products 




Characters of nonabelian groups are more difficult to describe than 
in the abelian group case. There always exists a character, the trivial 
character of G, which takes on the value 1 at all points of G. 

Denote the collection of all characters of G hy G* . We can view a 
character p of G as a formal sum in CG, 

p = ^ p{x)x, 
x&G 

and G* as a subset of CG. 

Theorem 30 Suppose p is a character of G. If t ^ G, 

tp = pt = p{t~^)p. 



Corollary 10.3 Suppose p is a character of G. If f £ CG, then 

fp = pf = f{p)p, 
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where f{p) = ^)- 

Theorem 30 leads to an important formula for the product of two 
characters in CG. 



Theorem 31 For eharaeters p and A of G, 



pX 



Np, p = X, 

0 / A. 



3.7 Character Extensions II 

Methods will be developed for constructing characters of nonabelian 
groups having semidirect product representations. For a group G = 
H<^ K, the strategy is to begin with a character r of H, determine if 
r has an extension to a character of G and if it does, construct the 
collection of all characters of G extending r. 

An arbitrary character r of 77 does not necessarily extend to a char- 
acter oi G = H <^K. 

We assume that 77 is a normal subgroup of G throughout this paper. 
In Section 3.9, when required, we will explicitly assume that 77 splits in 
G and G = H^K. 

3.8 Group Action 

For y ^ G and / G CG, define G CG by 

f = yfy~^ = f{x)yxy~^. 

x£G 

Two characters t\ and T 2 of 77 are said to be eonjugate in G if there 
exists y G G such that T 2 = rf. Conjugacy is an equivalence relation on 
the set 77* of all characters of 77. 

Suppose r is a character of 77 and G(r) its centralizer in G. Choose 
a complete system of left G(r)-coset representatives in G, 

{ys : 1 < s < <5}. 

Every y G G can be uniquely written as 

y = VsZ, 1 < s < S, z G G(r). 

Define the conjugates r^, 1 < s < S', of r by 



Ts = VsTys , 



1 < s < S. 



( 3 ) 
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Theorem 32 If t is a character of H, the conjugates Ts, 1 < s < S , of 
T defined by (3) are distinct characters of H and the set 

{Ts ■■ I < s < S} 

is the collection of all conjugates of r in G. 

Proof Suppose y G G and write y = ygZ, 1 < s < S, z G G{t). Since 
G{t) centralizes r, 

yry~^ = ysZTz~^yf^ = ysTyf^ = Ts 

and every conjugate of r in G is of the form r*, 1 < s < 5. If = r*, 
1 < s, t < 5, then 

T = y'f^ytryfi^ys- 

This implies yf^yt G G{t) and y* = yt, completing the proof. 

By Theorem 31 we have the following corollary. 

Corollary 10.4 If 1 < s, t < S and s t, then TgTt = TtTg = 0. 

The characters fin, 0 < n < M, of Cm{x) are defined by 
fin{x^) = , 0 < m < M, u = 

If 0 < n < and m G U{M), then since 

fin”"{x) = fin{x^) = finm{x), 

the set 

{finm : m G U{M)] (4) 

is the set of conjugates of fin in GM{x)fi U{M). The product nm is 
taken modulo M. (4) is called the conjugacy class of fin in Cm^ U{M). 

3.9 Character Extension 

Throughout this section H is a normal subgroup of G and K is an 
arbitrary subgroup of G. Following Corollary 10.5, we assume G = 
H<j K. The main results on character extensions will be proved under 
this assumption. 

Suppose p is a character of K. We say that p extends to a character 
of G if there exists a character 7 of G such that p is the restriction of 7 
to K. Generally, even if il' is a normal subgroup of G, a character p of 
K will not necessarily extend to a character of G. This obstruction is a 
major difference between the abelian and the nonabelian group cases. 




278 



COMPUTATIONAL NONCOMMUTATIVE ALGEBRA 



The following result establishes a necessary condition for a character 
r of the normal subgroup 77 of G to have an extension to a character of 
G. For groups of the form G = H<). K, it is also a sufficient condition. 

Theorem 33 If a character t of H extends to a character of G, then 
G centralizes r. 

Theorem 34 If t is a character of H and G\ is a subgroup of G con- 
taining H such that r extends to a character of G\, then G\ C G(r). 

As a special case, we have the following corollary. 

Corollary 10.5 Ifr is a character of H and H = G{t), then r cannot 
he extended to a character of any subgroup of G strictly containing H . 

In the language introduced below, if r is a character of H such that 
H = G{t), then r is a maximal character in G. 

Assume G = H K for the rest of this chapter. A typical point x € G 
will be written uniquely as 



X = uv, u G H, V G K, 

with the understanding that u = u ■ 1 and v = 1 ■ v. 

Theorem 35 If t is a character of H centralized by K, then every 
product t\, where X is a character of K , is a character of G extending 

T. 

Proof Choose any character A of AT and form the product p = rA. We 
must show p is a character of G. Since vt = tv, v € K, we have 

xp = uvtX = (ur)(uA), x = uv, u G H,v G K. 

By Theorem 30 

xp = r(u“^)A(u“^)rA = p{x~^)p, 

and p is an L(G)-eigen vector. Since p{l) = 1, p is a character of G, 
completing the proof. 



Corollary 10.6 Ifr is a character of H, then r extends to a character 
of G if and only if K centralizes r. 

Theorem 35 describes a procedure for constructing characters of G 
extending a character r of 77 centralized by K. In fact all the characters 
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of G extending r are constructed in this way. For each character r of 
centralized by K denote by 

extr{G) 

the collection of all characters of G extending r. 

Theorem 36 If t is a character of H centralized by K, then extr{G) 
is the collection of all products tX, where X is a eharacter of K . 



The collection G* of all characters of G is the disjoint union of the 
sets 



extr{G), T £ H* and r centralized by K. 



Example 37 If p is prime, there are p — 1 characters of the group 
Cp{x) <^U{p). The only character of Cp{x) which extends to Cp(x) U(p) 
is the trivial character (po. Since U(p) has order p — 1, there are p — 1 
characters Xj, 0 < j < p — 2, of U(p). The group algebra products 

0<j<p-2, 

are the p — 1 characters of G{p)<^U (p) . 



Example 38 For N >2, the characters of the normal subgroup 



Gn{x) = {x™' : 0 < m < A^} 

of the dihedral group T> 2 n = TX 2 n{x, k]\f-i) are given by 4>n '■ Gpj{x) — > 
Eat, 0 < n < AI, 

cfn{x^) = v'^^, 0 < m < A^, u = 

Since T> 2 n = Gn{x)^ K, where K = {1, fcAr_i} and 

kN-ixkf/_;^ = 

4>n is centralized by K if and only if is real. For N odd, 

we must have n = 0 and the trivial character 4>q is the only character 
of Cn{x) centralized by K. If N is odd, T> 2 n has two characters, the 
trivial character and the character p\ defined by 

= (-1)^ 0 < m < AI, 0 < j < 2. 

For N even, we must have n = 0 or n = ^ and the trivial character (f)Q 

and the character c/>jv are the only characters of C'at(x) centralized by 
2 

K. If N is even, T> 2 n has four characters, the trivial character pQ and 
the three characters defined by 

Pl{x"^kj,_,) = (-l)^ P2{x^kj,_,) = i-ir, P3{x^kj,_,) = 

0 < m < AI, 0 < j < 2. 
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3.10 Maximal Extensions 

We continue to assume that G = H K . If a character r of H is not 
centralized by K, then r cannot be extended to a character of G. Since 
H is a subgroup of G'(r), G(r) has the form 

Gir)=H4K{T), 

where K(t) is the centralizer of r in K, 

K{t) = {v ^ K ■. VTV ~^ = r}. 

Theorem 39 If r is a character of H , then r extends to a character of 
G{t) =H4K{t). 

Denote by 

extr{G{T)) 

the collection of all characters of G'(r) extending r. 

Theorem 40 Ifr is a character of H, then extr{G{T)) is the collection 
of all products in CG of the form tX, where X is a character of K{j). 

For future use we have the following corollary. 

Corollary 10.7 If t is a character of H, then 

P = T 

p&extr(G(r)) XgK{t)* 

Suppose Gi is an arbitrary group and Ki is a subgroup of Gi. A 
character 7 of Ki is called a maximal character in G\ if 7 has no 
extension to a character of a subgroup of G\ strictly containing K\. 
Any extension of a character 7 of K\ to a maximal character in G\ is 
called a maximal extension of 7 in Gi. 

Theorem 41 If t is a character of H then extr{G{T)) is a collection 
of maximal extensions of r in G. 

We will show that the left ideal generated by a maximal character is 
irreducible. 

Suppose r is a character of H and p is a character of G(r) extending r. 
G(r) is not necessarily a normal subgroup of G. Consider the centralizer 
G(p) of p in G. Since the support of p is G(r), G(p) normalizes G(r), 

yzy ~^ eG { T ), 2; G G(r), y G G(p). 

G(r) centralizes p implying G(r) is a normal subgroup of G(p). 

Theorem 42 If t is a character of H and p is a character of G{t) 
extending t, then G{p) = G(r) . 
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3.11 Abelian by Abelian Semidirect Products 

Groups of the form G = A<^ B with A and B abelian groups are 
perhaps the simplest generalizations of abelian groups. Not surprisingly, 
the DSP of this class of groups closely resembles that of abelian groups. 

We proved that each character t oi A extends to a character of the 
centralizer G(r) = A<^ B{t) and that the set extr{G{T)) of all characters 
of G(r) extending r consists of maximal characters in G. 

3.12 EXT{G,A) 

Suppose r is a character of A. The centralizer of r in G is G(r) = 
A^ B{t). Denote by 

extr{G{T)) 

the collection of all extensions of r to characters of G{t). 

Denote by 

EXT{G, A) 

the union of the sets extr{G{T)) as r runs over all characters of A. By 
Theorem 41 every character 7 in extr{G{T)) is a maximal extension of 
r and EXT{G, A) consists of maximal characters in G. 

Example 43 If G = Gp(x) C/(p), p a prime, £'XT(G, C'p(x)) has order 
2{p — 1 ). The characters (/>„, 0 < n < p, of Gp{x) are maximal characters 
in G. The trivial character (po extends to p — 1 characters of G. 

Example 44 If G = GgCx) C/( 8 ), EXT{G,Gs{x)) has order 16. The 
characters (pi, (p^, (p^ and (pj of G%{x) are maximal characters in G. As 
elements of CG they can be written explicitly as 

(pi \ V 'iP' rP rP ^ 

1^3 1 rP V rP rP rP ^ 

(p^ 1 rP rP V vP iP '■ 

(pj 1 v‘^ iP V J.7 

The character (p 2 of G%{x) has two maximal extensions in G to char- 
acters of Cs{x)<p K 2 , K 2 = {1,5}. A similar remark can be made for 
the character (pe,. The characters (po and (pi of G^{x) extend to eight 
characters of G. 

The first result extends, to the characters in EXT{G, A), the product 
formula of Theorem 9 for characters of abelian groups. 

For / G CG set |/| equal to the order of the support of /. 
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Theorem 45 7/ 71, 72 G EXT{G,A), then 



7i72 = 



I71I71, 71 = 72, 
0, 71 / 72- 



In the language of idempotent theory a nonzero element e G CG is 
called an idempotent if = e. Two idempotents e\ and 62 in CG are 
called orthogonal if 6162 = 6261 = 0 . Theorem 45 says that 

is a set of pairwise orthogonal idempotents. Using the language of idem- 
potents, we call EXT(G, A) a set of pairwise orthogonal eharaeters in 

G. 

The next result extends Theorem 11 to EXT{G, A). 



Theorem 46 



1 = 



E 

y&EXT{G,A) 




A set 2 of pairwise orthogonal idempotents is said to be eomplete if 

i = E- 

eSX 

Theorems 45 and 46 say that the set 

is a complete set of pairwise orthogonal idempotents. Using the language 
of idempotents, we call EXT{G, A) a eomplete set of pairwise orthogonal 
eharaeters in G. 

Arguing as before, we have the following result. 



CG = 0 CG7. 

■y£EXT{G,A) 



Theorem 47 
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3.13 A Basis of CG 7 

Suppose 7 G EXT{G, A). We will show that CG 7 can be multidi- 
mensional, determine the dimension of CG 7 and construct a basis of 
CG 7 . In the next section we will show CG 7 is irreducible. 

Choose a complete system of left G( 7 )-coset representatives in G, 

{VsA <s < S}. (5) 

In general this system depends on 7 . Since G( 7 ) is a normal subgroup 
of G, by Theorem 32 

ls = ysiys^, 1 < s < 5, 

is the set of all conjugates of 7 in G and 



=0, 1 < Si, S2 < -S', Si / S2- 

Theorem 48 1/7 G EXT{G,A), then 

[Vsl : 1 < s < 5} 
is a basis of the spaee CG 7 . 



Proof Suppose x G G and write x = ysZ, 1 < s < S, z G G( 7 ). By 
Theorem 30 

X7 = ysZj = -f{z~^)ys^. 

Since the elements x'y, x G G, span CG 7 , the elements ysj, 1 < s < 5, 
span CG 7 . 

To prove linear independence, suppose 

s 

0 = ^a(s)y^ 7 , a(s) G C. 

s=i 



For any t, 1 < t < S, 



s 

0 = 7 i 0 = ^a(s) 7 i 7 sys = \'y\a{t)yt'y 

S=1 



implying 

completing the proof. 



a{t) = 0, 



1 < t < -S, 



The basis of CG 7 

{Vsl : 1 < s < 5} 

will be denoted by BAS{'y). It consists of the left multiplications of 7 
by the system (5). The basis of CG formed by the union of the sets 
BAS{'y), 7 G EXT{G, A), will be denoted by BAS{G,A). 
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3.14 Irreducibility of CGj 

Theorem 49 If u ^ EXT{G,A), then CGj is an irreducible left ideal. 

Proof We continue to assume that {r/s :l<s<5}isa complete 
system of left G( 7 )-coset representatives in G. Suppose P is a nonzero 
left ideal in CGj and f G V , f ^ 0. By Theorem 48 

s 

/ = X] 

s=i 

where a(si) ^ 0 for some 1 < si < S'. Theorems 45 and 31 imply 

s s 

= ^<^ishsiys7 = ^a{s)jsilsys = a{si)\j\jsiysi- 

S=1 S=1 

Since P is a left ideal and f G V, Jsiysi £ V and 

7 = e V 

implying V = CGy, completing the proof. 

3.15 Expansion Coefficients 

A formula will be derived for the expansion coefficients of f G CG 
relative to the basis BAS(G, A). An algorithm implementing this for- 
mula will be constructed by making explicit the relationship between 
the characters of A and EXT(G, A). 

Suppose 7 G EXT{G, A). Choose a complete system of left ^( 7 )- 
coset representatives in G, 

Vsiu), 1 < s < S(7). 

The dependence of the system on 7 is now explicitly expressed. The 
basis BAS{G,A) of CG consists of all elements of the form 

ysilh, j G EXT{G,A), 1 < s < S{j). 

The expansion of f G CG relative to this basis will be written as 

5(7) 

/ = f'ris)ysiuh- 

j&EXT{G,A) 

In this section we derive a formula for the expansion coefficients 

f^{s), jGEXT{G,A),1<s<S{j). 
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In the next section we will use this formula to derive a fast Fourier-like 
algorithm to compute these expansion coefficients. 

By Theorem 45 if / G CG and 7 G EXT{G, A), then 

SG) 

fl = ItI X] /7(«)ys(7)7- (6) 

s=i 

The expansion coefficients of the component of / in CG7 relative to 
the basis BAS{'y) are given by f'yis), 1 < s < S{'y). 

Theorem 50 If f £ CG and 7 G EXT{G, A), then 

Ms) = -A X] i<s< 5(7). 

' '' z&GG) 

Proof Set Us = UsM and S = 5(7) throughout the proof. For x £ G, 
write X = ygZ, 1 < s < S, z £ G{'y), and 

s 

/ = X] = '^ f{ysz)ysz. 

x&G s=l z&GG) 

Since by Theorem 30, ysZ^ = ^{z~^)ysl, we have 

M = J2\ ysj. 

S=l \z&GG) / 

Comparing with (6) completes the proof. 



4. Examples 



We will compute EXT{G, A) and BAS{G,A) for the groups G = 
B, where A = Cn{x) x Ci\f{y) and B = C'2(fca) x C2{kb) with 



a = 



-1 0 
0 -1 ’ 



b 



0 1 
1 0 



The fixed points in A under actions by the elements of B depend on 
whether N is even or odd. 



Fixed points 





N cvc'ii 


N odd 


k(i 

h 

ka k'E 


( 0 , 0 ), (f,o),(o,f), (f,f) 

( 0 . 0 ). ( 1 , 1 ) (./V- 1 ../V- 1 ) 

( 0 , 0 ), (l,iV-l), ..., (/V- 1 , 1 ) 


( 0 , 0 ) 

( 0 , 0 ), ( 1 , 1 ) {N - -\.N - 1 ) 

( 0 , 0 ), ( 1 ,A' - 1 ), , , ,, (A' - 1 , 1 ) 
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Apart from the difference in the first row, it is important to note that 
for N even, (y, is a fixed point for kb and kakt, but is not in the 
indexing set if N is odd. 



Centralizer in B 





character index I 


Centralizer 


N even 


N odd 


B 


(0,0), (f,f) 




(0,0) 


Bi 


(f,0), (0,f) 




None 


Bz 


lA 

A 




(1,1), 1 <l <N 


Bs 


{l,N ~l), l<l< N, / / 4 


{LN -1), l<l< N 



4.1 EXT{G^,A) 

Ext{G\, A) can be computed from the fixed points and centralizers 
and the characters of the subgroups of S. If the subgroup C of S is the 
centralizer of the point (k,l), then the collection of products in CGi, 

^ £ C* 

is the collection of all characters in EXT(G\, A) extending Tk,i- 
The characters of B are 



-^0,0 = ^ + ka + kb + kakb, (trivial character). 

Aq,1 — It ka kb kakbj 

Ai 0 — 1 ka kb kakb, 

Api = 1 - ka - kb + kakb- 

The characters of the subgroups of B are given by restriction. 



Characters of subgroups of B. 



Subgroups 


characters 


Bo 


1 


Bi 


1 ka-, 1 ka 


Bz 


1 kb, 1 — kb 


Bo 


1 kakb, 1 kakb 



The following table describes EXT(Gi, A), for even N, by describing 
the subsets extr{Gi{T)) of characters in EXT{Gi, A) extending r G A*. 
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EXT{Gi,A), N even. 



T 


extriGiG)) 


Gi(r) 


T0,0, TN N 
9. ’ 


r Ao,o, r Ao,i, T Ai,o, r Ai,i 


Gi 




t(1 -\- ka), r(l - ka) 




n,i 


r(l - 1 - kb), r(l — kb) 


A4B2 


ri,N-i 


r(l - 1 - kakb), r(l — kakb) 

1 < ; < Af, ; / f 


A4B3 



Characters r of A not listed above have i?o = { 1 } as their centralizer 
and are maximal characters in EXT{G\, A). 

EXT{Gi,A), N odd. 



T 


exU(Gi(T)) 


Gi{r) 


T0,0 

TlJ 

T^N-I 


TO.oAo.O, To^oAo,!, To,oAi,o, To,oAi,1 
Tij(l + kb), Tki(l - kb), 1<1<N 

+ kakb), - kakb), 1 < 1 < N 


Gi 

A4B2 

A4B3 



Characters t oi A not of the form given above are maximal characters 
in EXT{Gi,A). 

The basis BAS{Gi, A) is the disjoint union of the bases BAS{'j) of 
the irreducible left ideals CG17, 7 G EXT{G\, A). BAS{'y) is the set 
given by the left multiplications of 7 by the elements in the complete 
system of left i?(7)-coset representatives in B. 

We will define an ordering on BAS{Gi, A) which is compatible with 
the direct sum decomposition of CG\ into the irreducible left ideals 
CGi7, 7 G EXT{G\, A). The elements in each basis BAS{'^) of CG17 
occur as contiguous elements. 

EXT{G\, A) consists of all elements of the form 

7 = 0 < k, I < N, X e Bijkg)* . 

Order the characters of B as defined and order the characters of sub- 
groups. Order EXT{G\, A) by ordering (k,l,X), lexicographically. In 
this ordering A is the fastest running parameter, followed by I and then 
by k. 

The ordering in EXT(Gi, A) induces an ordering on the collection of 
all bases BAS{'y), 7 G EXT(Gi, A). Ordering the elements in each of 
these bases, we have ordered BAS{Gi, A) with the required compatibil- 
ity condition with respect to the direct sum decomposition. 
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BAS{j), N = 3 



5(7) 


BAS{j) 


B 


ro,oAo,o, ro,oAo,i, to,oAi,o, ro.oAi,! 


5i 


None 


52 


Tl,l(l + kb), fcaTl,l(l + kb), n,l(l - kb), fcaTl,l(l - kb), 
T2,2(1 + kb), kaT2,2{l + kb), T2,2{1 ~ kb), kaT2,2{l ~ kb). 


53 


T1,2(1 + kakb), fcoTl,2(l + kakb), n,2(l ~ kakb) , fcon,2(l - kakb) , 

T2,i(1 + kakb), kaT2,l{l + kakb), T2,l(l - kakb), fcoT2,l(l - kakb), 


5o 


ro,l, kb^Q^l, ka^Q^l, kakbTQ^I 

To, 2, khTo,2, kaT0,2, kakbTQ,2 

ri,0, kbTl,o, kaT\,o, kakbTl,0 
T2,0, kbT2,0, kaT2,0, kakbT2,0 



Figure 8 . Normalized basis of C ((Cs x Cs)^ (C2 x C2)) 



ENb 



asd 



aHa 

EfBE 

aan 

BSE 

ana 



Real part 



aaena 

Kfsa 

Imaginary part 
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Figure 9. Normalized basis of C ((C4 x C 4 )^ {C 2 x C2)) 






Imaginary part 
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Figure 10. Generators of invariant subspaces of C ((C4 x G4)^ (C 2 x G2)) 



■ m 

m 

■ iffl 

Imaginary part 



In Figure 8, the coefficients of the basis of G\, for N = 3, are displayed 
as log-scaled intensity plots, with respect to this 2-dimensional index- 
ing. The first column of 4 images corresponds to the characters of Gi, 
scaled by |Gi| = 36. Each of these elements generates a 1-dimensional 
£Gi-invariant subspace. The imaginary parts of the images in the first 
column are identically zero. The second column of 8 images, in pairs, 
corresponds to the basis of 2-dimensional invariant subspaces generated 
by the maximal extensions to B 2 , scaled by 18. The third column 
of 8 images, in pairs, corresponds to the basis of 2-dimensional invari- 
ant subspaces generated by the maximal extensions to A<^ B 3 , scaled 
by 18. The remaining two columns of images, 4 images at a time, cor- 
respond to the basis of 4-dimensional invariant subspaces generated by 
the characters of A which are maximal in G±, scaled by 9. 

An index matrix of G±, for iV = 4, is defined in an analogous way. In 
Figure 9, the coefficients of the basis of G±, for N = 4, are displayed as 
log-scaled intensity plots, with respect to this 2-dimensional indexing. 



.TTTi^SSA! 




Real part 
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An intensity scale bar is appended on the right for reference, and applies 
to all image plots in this chapter. 

The first column of 8 images corresponds to the characters of Gi, 
scaled by IGil = 64. Each of these elements generates a 1-dimensional 
£Gi-invariant subspace. The second column of 8 images, in pairs, cor- 
responds to the basis of 2-dimensional invariant subspaces generated by 
maximal extensions to A<^ B\, scaled by |A<^ili| = 32. The imagi- 
nary parts of the images in the first two columns are identically zero. 
The third column of 8 images, in pairs, corresponds to the basis of 2- 
dimensional invariant subspaces generated by the maximal extensions 
to A<^ B 2 , scaled by 32. The fourth column of 8 images, in pairs, cor- 
responds to the basis of 2-dimensional invariant subspaces generated by 
the maximal extensions to A<^ B^, scaled by 32. The remaining columns 
of images, 4 images at a time, correspond to the basis of 4-dimensional 
invariant subspaces generated by the characters of A which are maximal 
in Gi, scaled by 16. 

In Figure 10, the 28 generators of invariant subspaces are displayed. 
These are the results of adding the basis elements belonging to an in- 
variant subspace. 

5. Group Transforms 

Suppose G is an arbitrary group and A is a complete system of prim- 
itive, pairwise orthogonal idempotents in CG. The G-group transform 
or simply G-transform is the collection of right multiplications of CG 

{R{e) : e G A} . (7) 

Reference to A will always be suppressed. Each right multiplication 
in (7) is a G- component filter. Any G-component filter or sum of 
G-component filters is a G-filter. 

The G-transform of / G CG is the collection 

{R(e)/:eGA}. (8) 

Each product in (8) is a G-spectral component of /. Any G-spectral 
component or sum of G-spectral components of / is a G-filtering of /. 

For an abelian group A we always take A = A(A). Abelian group 
transforms are probably well-known to the reader. Since each filter space 
of the A-transform 

CAe, e G A(A), 
is one-dimensional, the A-spectral component 

/e = f{T)e, e = 
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can be represented by /(r). The A- spectral image of / is the collection 

|/(r) : r G yl*| . 

For any abelian by abelian semidirect product G = A<^ B we always 
take A = A(G). The filter spaces of the G-transform 

CGe, e G A(G), 

are not necessarily one-dimensional. The G-spectral image of / is the 
collection of expansion coefficients of / relative to the basis BAS{G, A). 
A G-filter is a linear operator P : CG — > CG satisfying 

P{yf) = yP{f), /gCG, yGG. 

By linearity if P is a G-filter, then 

P{gf)=gP{f), f,g€CG. 

Every / G CG defines a G-filter R{f) by 

R{f)g = gf, 5 e CG. 

R{f) is called right multiplication by /. In general L{f) is not a G-filter. 
Theorem 51 If P is a G-filter, then there exists f G CG such that 

P = R{f)- 

The image of the G-filter R{f) is the left ideal CGf. We call CG/ 
the filter space of R{f). Since we can have CG/i = CGf 2, for distinct 
/i) /2 G CG, distinct G-filters can have the same filter space. 

5.1 Matched Filtering 

Suppose G is an arbitrary group and A is a complete system of prim- 
itive, pairwise orthogonal idempotents in CG. For / G CG, the G- 
matched filter of / is the G-filter 

6/ = ^ e. (9) 

PAO 

The summation in (9) runs over all idempotents e G A such that fe 0. 
The G-matched filter of / satisfies 



/e/ = /• 
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In a typical image processing application an image / G CG is contained 
in noise and background. The problem is to separate the target / from 
the noise and background. The image data has the form 

9 = f + V, 

where rj G CG represents the noise and background. The G-matched 
filtering 

gef = f + vef 

is usually the optimal G-filtering for separating / from r/. Examples of 
G-matched filtering will be given in the following sections. 

The concept of a G-matched filter is a major tool in both abelian 
and nonabelian group filter design. As defined, the formal structure of 
a G-matched filter does not change with different choices of group G. 
However, the choice of indexing group is a critical parameter affecting 
several important characteristics of the resulting matched filters. We 
have distinguished two properties which can vary as the indexing group 
varies: invariance under left group actions and the extent and form of 
image domain locality. The first will be discussed in this section. Image 
domain locality is the topic addressed in the next section. 

For an image / G CG, the G- equivalence class of images containing 
/ is the equivalence class of images in the set 

{xf : X G G} . 

The G-equivalence class containing / depends on the actions of the left 
multiplications from G on f and can vary as G varies. 

The G-matched filter of / satisfies 

xfcf = xf, X G G. 

This implies that the G-matched filter of / is the G-matched filter of 
every image G-equivalent to /. In applications this means that the G- 
matched filter constructed for an image / will be the optimal filtering 
not only for /, but also for all G-equivalent images. 

5.2 Image Domain Locality 

Image domain locality is a key component of nonabelian group fil- 
ters not available with abelian group filters. To understand where this 
property comes from and how it can be controlled at the group level, we 
will show that the expansion coefficients in nonabelian group DSP can 
encode local image space information. 

The explanation is related to the character extension problem. If A 
is an abelian group, then this is straightforward. The basis for Fourier 
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representation is the collection of characters. Relative to this basis, each 
expansion coefficient contains image information across the image plane. 
These coefficients are sensitive to all local changes. The placing of a 
geometric structure in an image data set affects all expansion coefficients. 

The character extension problem is more complicated for an abelian 
by abelian semidirect product G = A<^ B. As we have pointed out there 
exist characters of A, which can not be extended to characters of G. The 
characters defining A(G) are maximal characters in G, but some have 
proper subgroups of G as their supports. 

Suppose e G A(G) and 



1 




where 7 is a character of the subgroup G{j) of G. For the discussion 
assume G{'y) is a proper subgroup of G. If 

{Vs ■ < s < S} 

is a complete system of left G( 7 )-coset representatives in G, then 

{Vse ■■ I < s < S} (10) 

is a basis for the irreducible left ideal CGe. The key point is that G^j) 
is a proper subgroup of G and the basis (10) is supported on pairwise 
disjoint left-cosets. For each s, 1 < s < S', the basis element is 
localized in the image domain to the left coset ysG{'y). 

Suppose / G CG. We can write 

s 

fe = J2<^is)yse. 

S = 1 

If S' G CG is supported in ytG{'j) for some 1 < t < S', then 
(/ + g)e = '^ Oi{s)yse -h a{t)yte. 

S^t 

The effect of the local image domain change on the expansion coefficients 
of the filtered image, resulting from the addition of g, is to change exactly 
one coefficient. The sensitivity to the local image domain depends on 
the relative orders of G and G( 7 ). 

For a fixed character r of A, the left ideal CGt can be written as 

CGt= 0 CGtX. 

AeB(r)* 
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^ ^ 

' ^ ^'AeB(r)*s=l 

This expansion shifts the basis in both the image and spectral image 
domains and can be viewed as a localized image-spectral image domain 
expansion in analogy to time- frequency expansions. Local image domain 
changes affect a set of prescribed coefficients and these changes describe 
the local image domain change in both image and spectral image do- 
mains. 

6. Group Filters 

In this section we study and compare properties of abelian and non- 
abelian group filters by illustrating their effects on lines and line seg- 
ments. These examples will concentrate on the image domain locality 
and the left group multiplication invariance of the filters. 

6.1 Abelian group filtering 

For N = 64, set 

Af-l 

Aq = £ CA. 

k=0 

Aq is the matched filter of the vertical line and all its abelian group 
translates. 

The following examples show detection and location by Aq of a ver- 
tical line and several of its abelian group translates. Varying levels of 
noise are added to generate the images in Figure 12. The noise is mod- 
eled as Gaussian with zero-mean. Levels of noise are indicated by the 
standard deviation (sd). Images in Figure 13 are the results of filtering 
by Aq. Even in severe noise, the location of maximum intensity yields 
the positions of the translates of a vertical line. 



Figure 11. Images containing a vertical line 
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Figure 12. Noisy images containing a vertical line 




jT t-V. jy. -: 



sd=.5 



sd=1.0 



sd=2.0 



sd=3.0 



Figure 13. Results of Filtering by 




In Figures 14 and 15 the translates of a vertical line are replaced by 
line segments of one half the length. The placement of the vertical line 
segment is given by the coordinate of the starting position. 



Figure 14 . Images containing a vertical line segment 




(4,4) (9,8) (24,51) (17,47) 



Figure 15. Noisy images containing a vertical line segment 

.V ' 

sd=.5 sd=1.0 sd=1.5 sd=2.0 

The last example shows that as the length of the line diminishes, 
the performance of the filter Aq for detecting line segments worsens. 
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Figure 16. Results of Filtering by Aq 




Figure 17. Images of vertical line segments 




(4,4) (9,8) (24,51) (17,47) 



Figure 18. Results of Filtering by Aq 




Moreover, the vertical position of the line segment cannot be detected, 
even without any noise as shown in Figure 18. 

The preceding examples show one of the major deficiencies of abelian 
group filtering, the lack of image domain locality. 



6.2 Nonabelian group filtering 

Consider the group 

G^ = A4C2{kc) 

and the group filter defined by 



Pn = 



N 



Af-l 

^ ^ R),fc T kcTQ k. 

k=l 



The choice of Pq was made for its simplicity, and because it is symmetric 
or real valued. An intensity plot of the filter is shown below for N = 32 
along with the intensity color scheme. 
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Figure 19. Intensity plot of the filter Pq, N — 32. 




The following example shows the importance of matching the filter to 
the targeted image. The first test image contains a vertical line segment, 
while the second test image contains a line segment of slope 1. In Figure 
20, the first and third images contain a vertical line segment, while the 
second and fourth images contain a line segment of slope 1. Noise levels 
are indicated by the standard deviation (sd). Figure 21 displays the 
results of applying Pq. 



Figure 20. Test images in noise 





■: - 
















-y.-- / 

■■re .7-.', ■..v.i 







sd = 2.0 sd = 2.0 sd = 5.0 sd = 5.0 



The filter Pq is matched to the vertical line and to all left multiplica- 
tions of the vertical line by the elements of G4. Since the line segment 
of slope 1 is not contained in the collection of G4-equivalent images, it 
cannot be detected by Pq. 

Invariance of Pq with respect to left multiplication by the elements 
from G4 implies that these left actions do not affect filter performance. 
The previous figure shows this for left multiplications by 0 < m, 

n < N. 
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Figure 21. Results of applying Pq 




Figure 22. 



fcc-translates of the images in Figure 20 




Figure 23. Results of filtering by Pq 




Each frame in Figure 22 displays the /cc-translate of the corresponding 
frame in Figure 20. Figure 23 displays the results of filtering the images 
in Figure 22 by Pq. 

The next pair of figures show the implications of invariance and image 
domain locality. In Figure 24 a vertical line segment of length 32 is placed 
arbitrarily, and the horizontal position is recorded. The problem is to 
determine the position using the filter Pq. Figure 25 shows the results 
of this filtering by Pq. 









300 



COMPUTATIONAL NONCOMMUTATIVE ALGEBRA 



Figure 24- *™-translates of an image containing line segment of length 32 




m=ll m = 5 m = 6 m = 28 



Figure 25. Results of filtering by Pq 




{u,l)= (.585, .382) (.761,. 207) (.732, .236) (.117, .851) 



The pair of numbers (tt, 1) in Figure 23 are the intensities of the upper 
and lower half of the line segments. Note the close relationship between 
the intensities of the line segment and its position. (The intensity of the 
input line segment in Figure 25 is 1.0, but this information is only rela- 
tive.) The relationship between the difference of intensities and position 
is given by 



m ~ N — 



uN 
u + I 



( 11 ) 



Accuracy of the relationship (11) depends on N, up to ^ pixels. Thus 
the smaller N is the more accurate the relationship (11). 

The same experiment is repeated with varying levels of noise. The 
position of the line segment is estimated from the filtered image using 
(11), and denoted by m' in Figure 27. 

In the presence of noise, the intensity is not uniform through the 
upper or lower half of the line segment, but the variance is very small. 
Estimate of location in (11) is derived using the intensity at position 
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Figure 26. 



^’"-translates of an image containing line segment of length 32 




Figure 21. Results of filtering by Pq 




m' = 16.1 m’ = 23.6 m' = 7.7 m' = 18.3 



(0,y) and {N,y), where y is the horizontal position of the line segment. 
A better estimate than (11) can be derived, especially in cases where 
noise characteristics are known. 

As these experiments show, detection and location of vertical lines of 
uniform intensity of known lengths can be implemented using Pq. Using 
similar analysis, a relationship can be derived and the same implemen- 
tation can be used to locate diagonal lines shown in Figure 22. Since 
kcX^y'^k~^ = the x-intercept is equivalent to the horizontal 

location of the vertical line. 

Note that the length of a line segment must be at least N for ex- 
ploiting the localization property consistently. More specifically, if the 
length of the line segment is less than Ai, in the case where the line 
segment lies entirely in lower or the upper of the two planes, we can 
localize the line segment only up to one of the two planes. This provides 
one major parameter determining the design of groups and filters in de- 
tection/localization applications: For consistent detection/localization 
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Figure 28. Intensity plot of the filter Pi, N — 32. 




of, for example, a vertical line segment of length N in an image of size 
KiN X K2N, the following two approaches can be used. 

1 Starting with a group G = (Cat x Cn)^ C2, we can apply Pq to 
subimages of size 2N x N, overlapped by at least N. 

2 Design filters associated to a group of the form G = {Cjg x Cat) ^ 

{Gki X C'a'j)- (Groups of the form {Cn^Gki) x (Cat x CK2) 
will work as well for detection/localization of vertical or horizontal 
lines, but groups of the form (Cat x {Cki x C'a'j) provide 

more flexibility in shapes.) 

The filter performance degrades as the noise level increases. This is 
particularly true with no prior knowledge of the noise characteristics. 
One method that will improve filter performance is to use a similar filter 
based on a different group of the same size. Consider the group 



G={CN{x)xCN{y))4C2{ke), 



where e 



0 1 
1 0 



Construct the G-filter 



-Pf ~ ^ ^ "^0,^ T k^TQ^k- 

n=l 

Pi is also real-valued. An intensity plot of P\ is shown in Figure 28. 

In Figures 29 - 31, results of applying Pq and Pi are shown in the 
third and fourth frames along with estimates of the location of the line 
segment. The first frame displays the line segment and the second frame 
displays the line segment embedded in background and noise. 
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Figure 29. Results of applying Po and Pi 




m = 3 sd = 2.0 m' = 5.0 m" = 5.6 



Figure 30. Results of applying Po and Pi 




m = 16 sd = 2.0 m' = 14.8 m” = 16.7 



Figure 31. Results of applying Po and Pi 




m — 4 sd — 2.0 m — 5.5 m — 2.2 



The accuracy of the estimate (11) differs between the filters Pq and Pi 
as the noise is filtered out in two distinct ways by the distinct convolu- 
tions defined by the two groups. This is another parameter determining 
design of a detection/location application. By applying several distinct 
groups, detection and location can be made more accurate. A trade-off 
is clearly the cost of processing. 
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7 . Line-like Images 

In this section the performance of groups filters on non-digital, line- 
like objects is illustrated. Generally, filter performances are better due 
to the fact that non-digital lines have more than one pixel width. 

In Figure 32 the first image is the digital line of slope 0. The second 
image is generated by rotating the horizontal line by 27° clockwise about 
the center of the image. Rotation is implemented using bilinear interpo- 
lation. The third image displays a portion of the rotated line near the 
center of rotation. 



Figure 32. Result of rotating a line 




horizontal line result of rotating detail around 
through (64, 64) the horizontal line center of rotation 



The following example compares the results of filtering a digital test 
image and a rotated test image in varying levels of noise. 



Figure 33. Test images in noise, sd = .5 




Images in Figure 33 are as follows. 

(al) digital line segment of slope — (bl) digital line in noise, 
(cl) rotated line, (dl) rotated line in noise 

The next example illustrates the performance of matched filters in 
detecting and locating a line-like object in recorded data. 
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Figure 34- Results of filtering in noise, sd = .5 




Figure 35. Sonar image of size 128 x 128 and result of filtering 




piece of data containing result of filtering 

an object of interest 



The next collection of examples are included to illustrate: 

1 the diversity of images which can be created by composition of 
parallel lines of varying orientations, 

2 the sensitivity of the matched filters to changes in orientations, 

3 design strategy for matched filters whose properties vary with im- 
age size. 

For a positive integer K and N = IQK, consider the groups 
G^ = A4Ci{ka), GT = A^C^{h), 
where A = Gn{x) x Gigiy) and 



1 


^ + 1 1 


h — 


1 

t-H 

1 


[f + 1 


0 


? ^ — 


Lf-1 0 j 



o = 
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The sum of 1-dimensional idempotents in CGq and in CG 7 define 
matched filters 



N-l 3 

Tk,{M+l)k^j £ CGq, 

k=0 j=0 
N-l 3 

Tk,{M-l)kTj £ CGy- 

k=0 j=0 

Figures 36 and 37 display the filters Pq and Pj. Unlike the matched 
filters we have previously described, properties of P% and Pj depend on 
N which determines the image size. 



Figure 36. Matched filters, N = 16. 




Pq g CGq Pj g CGy 



The first image in Figure 38 is a composition of two sets of intensities 
along parallel lines of slopes 3 and 4. The results of applying Pq and P 7 
in noise are displayed. Noise is simulated as Gaussian of zero mean with 
varying values of standard deviation. 

The last image in Figure 38 is not an algebraic object in that it belongs 
neither to CGq nor CG 7 . It is simply the sum of two intensities defined 
on an array of size 2N x 2N. 

In Figure 40, g is viewed as an element of CG 7 . The last image is 
again simply the sum of two intensities. 



Figure 37. Matched filters, N = 32. 




Pq G CGq Py G CG 7 
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Figure 38. Results of filtering in noise 



test image f f in noise 



-Pe + P? 



Figure 39. Results of filtering in noise 



Figure 40. Results of filtering in noise 



g = f + /cfef g' = g+ noise g' P q g'Pg g' P q + g' Pj 

The first image in Figure 40 is generated by adding to f its translation 
by kb. 

The results of applying Pq and Pg in noise are displayed. 



Figure 41. Results of filtering in noise, N = 32 



6 +gP? 
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Abstract Geometric algebra has been proved to be a powerful mathematical lan- 
guage for robot vision. We give an overview of some research results 
in two different areas of robot vision. These are signal theory in the 
multidimensional case and knowledge based neural computing. In both 
areas a considerable extension of the modeling has been achieved by 
applying geometric algebra. 

Keywords: Clifford algebra, Clifford spinor neuron, geometric algebra, hypersphere 
neuron, monogenic function, neural computing, Poisson scale-space, 
robot vision, signal theory. 

1. Introduction 

In this tutorial paper we present a survey of some results contributed 
by the Kiel Cognitive Systems Group to the applications of geometric 
algebra in robot vision. Geometric algebra makes available a tremen- 
dous extension of modeling capabilities in comparison to the classical 
framework of vector algebra. It is our experience that application of 
geometric algebra should be strictly controlled by the geometric nature 
of the problem being considered. To demonstrate that tight relation 
between the problem at hand, its algebraic formulation and the way to 
find solutions will be our principle matter of concern. We i will do that 
by considering some key problems in robot vision. More details of the 
results reported here can be found in the papers and reports of the Kiel 
Cognitive Systems Group at our website (http:/ /www. ks. informatik. uni- 
kiel.de). In addition, an extended version of this paper will be available 
[Sommer, 2003]. 
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1.1 Robot Vision 

Robot vision is a demanding engineering discipline emerging from 
several contributing scientific disciplines. It aims at designing mobile 
technical systems which are able to take actions in its environment by 
using visual sensory information. 

Although a matter of research for three decades, we are far away 
from having available seeing robots which are able to act comparable 
to humans in real world conditions. There are several reasons. First, 
many different disciplines such as image processing and signal theory, 
pattern recognition including learning theory, robotics, computer vision, 
and computing science are required for robot vision. Each of these have 
their own problems caused by limited modeling capabilities. Second, 
each of them has been developed to a great extent, isolated from the 
other ones, by using quite different mathematical languages of model- 
ing. Thus, the fusion of all these disciplines within one framework is 
demanding by itself. Third, the most difficult problem is the design of 
the cognitive architecture. This concerns e.g. the gathering and use 
of world knowledge, controlling the interplay of perception and action, 
the representation of equivalence classes, invariants, and conceptions. 
Besides, such a system has to cope with hard real-time conditions. 

The design of perception-action cycles cooperating and competing for 
solving a task is a demanding challenge. Of special interest is to enable 
the system to learn the required competence [Pauli, 2001] from experi- 
ence. From a mathematical point of view the equivalence of visual and 
motor categories is remarkable. Both are mutually supporting [Sommer, 
1997]. Of practical importance are the following two projections of a 
perception-action cycle. “Vision for action” means to control actions 
by vision and “action for vision” means the control of gaze for making 
vision easier. 

1.2 Motivations of Geometric Algebra 

Representing geometry in a general sense is a key problem of system 
design. But only those geometric aspects have to be represented which 
are of pragmatic importance. This opportunistic minimalism is tightly 
related to the so-called stratification of space as introduced into com- 
puter vision by Faugeras [Faugeras, 1995]. The system should be able to 
purposively switch from, e.g., metric to projective or kinematic aspects. 
We come back to that point in section 4 of [Sommer, 2003]. 

Another interesting topic is to ensure pragmatic completeness of the 
representations. But signal theory which supports image processing op- 
erations fails in that respect [Sommer, 1992, Zetzsche and Barth, 1990]. 
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Local operations have to represent intrinsically multi-dimensional struc- 
ture. The search of solutions for that problem was our original motiva- 
tion for considering geometric algebra as a modeling language. We will 
survey our efforts in striving for a linear theory of intrinsically multi- 
dimensional signals in section 2. 

Our aim of representing geometry in a general sense means thinking 
in a Kleinian sense, thus taking advantage of the tight relationship be- 
tween geometry and algebra. All we have done in our work so far is 
based on choosing a useful geometry by embedding the problem into a 
certain geometric algebra. This is in fact a knowledge based approach to 
system design. In section 3 we will demonstrate this way of modeling in 
the context of neural computing. There we will profit from the chosen 
embedding because it results in a transformation of a non-linear problem 
to a linear one. 

The problem of converting non-linear problems in Euclidean vector 
space to linear ones by embedding into a certain geometric algebra is 
related to another basic phenomenon which makes geometric algebra so 
useful in robot vision and beyond. From a geometrical point of view 
points are the basic geometric entities of a Euclidean vector space. In- 
stead, a cognitive system is operating on geometric objects as a whole 
unique entity, e.g. a tea pot. An algebraic framework is wanted in which 
any object concept and transformations of it may be represented in a 
linear manner. Regrettably, this is an illusion. But geometric algebra 
enables the extension from point concepts to rather complex ones. In 
[Sommer, 2003] we demonstrate their linear construction and how to 
model their motion in a linear manner. 

We will abstain from presenting a bird’s eye view of geometric algebra in 
this contribution. Instead, we recommend the following introduction to 
geometric algebra [Hestenes et ah, 2001]. In this paper we will use sev- 
eral geometric algebras as well as linear vector algebra. We will mostly 
write the product in the chosen algebra simply by juxtaposition of its 
factors. In some cases we will use a special notation for the chosen ge- 
ometric product to emphasize its difference to a scalar product. Special 
products will be noted specially. We will use the notation for the 

geometric algebra derived from the vector space with p + q + r = n. 

These indices mark the signature of the vector space. Hence, {p, q, r) 
means we have p/q/r basis vectors which square to 1/0. If possi- 

ble we reduce the set of signature indices, as in the case of the Euclidean 
vector space 'SA. 
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2. Local Analysis of Multi-dimensional Signals 

Image analysis is a fundamental part of robot vision. We do not 
understand image analysis as interpreting the whole visual data with 
respect to the complete scene or recognizing certain objects mapped 
to an image. This in fact is subject of computer vision. Instead, the 
aim of image analysis is to derive features from visual data for further 
processing and analysis. Its theoretical basis in called signal theory. 

In (linear) signal theory we find the framework for handling linear 
shift invariant operators (LSI operators) and spectral representations of 
both signals and operators which are computed by Fourier transform. 
Both are tightly related. Although both are widely used in image anal- 
ysis, there is a serious problem with respect of supporting recognition of 
intrinsically multi-dimensional and especially two-dimensional structure. 
This is a problem of incomplete representation. 

We have to distinguish between two different conceptions of dimen- 
sionality of image data. The first one is the embedding dimension of data 
which is two in case of images and three in case of image sequences. The 
other one is the intrinsic dimension of local image structures. It expresses 
the number of degrees of freedom necessary to describe local structure. 
There exists structure of intrinsic dimensions zero (iOD) which is a con- 
stant signal, one (ilD) which are lines and edges and two (i2D) which 
are all the other possible patterns. As the meaning of “local” is depend- 
ing on the considered scale, the intrinsic dimension at a chosen image 
position is scale-dependent too. 

In this section we will show that by embedding image analysis into 
geometric algebra the mentioned representation gap hopefully will be 
closed. 

2.1 Local Spectral Representations 

Image analysis in robot vision means to derive locally structural hints 
by filtering. From the filter outputs certain useful features can be com- 
puted. A well known but naive way of filtering is template matching. 
Template matching filters are detectors for image structure. Because 
there is no way of designing templates for each possible structure, this 
method will get stuck soon if i2D patterns are of interest [Rohr, 1992]. 
Locally the only ilD patterns are lines, edges and textures constructed 
from these. 

What filter would enable local image analysis if it is no template of 
the pattern in search? The alternative is to use a set of generic features 
as basis functions of patterns than the patterns themselves. We call this 
approach the split of identity. 
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There are many different approaches for computing the split of iden- 
tity, including multi-scale analysis, local principal component analysis 
and variants of local Taylor series development. All of these take into 
consideration only the magnitude of the filters in the Fourier domain. 

Our preferred approach in the split of identity is to instead use quadra- 
ture filters hq [Granlund and Knutsson, 1995], 

hq — he -|- jhoi ( 1 ) 

that is complex valued filters, where {he,ho} is a pair of real filters 
with even and odd symmetry, respectively. An example is the widely 
used Gabor filter [Granlund and Knutsson, 1995]. This kind of filtering 
applies evenness and oddness as a feature basis for image analysis. 

Because these filters are in quadrature phase relation, that is, their 
phase is shifted by — {he,ho} is called a quadrature pair. Then con- 
volution of the image / with such a filter ha, a G {e, o}, 

ga{x) = {ha* f){x) (2) 

results in outputs ge and go, respectively, which represent locally the 
even and odd symmetry of /. If hq is a bandpass filter, then equation (2) 
enables the above mentioned multi-resolution analysis of local symmetry. 

We restrict ourselves for the moment to ID signals because for these 
the theory is well established. 

Global even and odd symmetry is intrinsic to the Fourier representa- 
tion of a real ID signal. This property is called Hermite symmetry. But 
filtering a real function / with a quadrature filter hq results in a complex 
valued function 



g{x) = ge{x) + jgo{x) (3) 

no longer having Hermite symmetry in the Fourier domain. Instead, the 
Fourier transform G{x) has the important property of approximating 
the analytic signal Fa{u), 

Fa{u) = F{u) + jFniu) = (1 -h sign{u))F{u) (4) 

very well within the passband of the quadrature filter Hq{u). The ana- 
lytic signal in the Fourier domain is composed of the Fourier transform 
of the real signal F{u) and the Hilbert transformed signal Fh{u), 

Fh{u) = Hj{u)F{u) = -j sign{u)F{u). (5) 



In the spatial domain the analytic signal is 

fA{x) = f{x)+jfH{x) 



( 6 ) 
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which is not only a complex function as in equation (4) but also has 
the property that / and fn are phase shifted by — The functions / 
and fn form a Hilbert pair. This is caused by the fact that the Hilbert 
transform Hj changes even signals to odd ones and vice versa. There 
are no amplitude changes because |H/| = 1 for all frequencies. Hj, being 
pure imaginary, causes only phase effects. The operator of the Hilbert 
transform in the spatial domain reads 



hi{x) = 



7TX 



( 7 ) 



The importance of the analytic signal results from computing the local 
energy e(x) and the local phase </>(x), 



e(a:) = /^(x) + /|r(x), 



(8a) 



(/>(x) = arg /a(x). (8b) 

Their use in local signal analysis decomposes the split of identity into 
quantitative and qualitative subtasks. If there exists a threshold e of 
significance, then e(x) > e indicates a certain local variance of data. In 
that case it is interesting to ask for the quality of the local structure. 
This is given by the local phase in the following way. An edge (odd 
symmetry) is indicated by </>(x) = if and a line (even symmetry) is 
indicated by 4>{x) = 0 or 0(x) = vr. The same ideas can be used to 
interpret the output of a quadrature filter ^(x), given in equation (3). 

Regrettably, this method of symmetry related split of identity which 
we outlined so far is not easy extendable to 2D functions, although it 
is used in image analysis for edge and line detection. In 2D signals 
it is limited to intrinsically one-dimensional functions. Hence, the aim 
is twofold: extension in a consistent manner to a 2D embedding and 
extension to intrinsically two-dimensional signals. We will present our 
first trials in the next subsection. These results are based on the PhD 
thesis of Biilow [Biilow, 1999]. 

2.2 Quaternionic Embedding of Line Symmetry 

In this section we will first show that the algebraic nature of the 
Fourier transform in the complex domain causes a limited representation 
of symmetries for 2D functions. We propose embedding the Fourier 
transform into an algebraic domain with more degrees of freedom. Then 
we will use this idea to generalize the concept of the analytic signal for 
embedding dimension two to cover also the case of i2D signals. 
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2.2.1 Quaternionic Fourier Transform. We use the well- 
known technique of decomposing a ID function f{x) for any location x 
into an even part, fe(x), and an odd part, fo{x), according to f{x) = 
fe{x) + fo{x)r. Because the Fourier transform is a linear integral trans- 
form it maps this decomposition in an integral manner into the frequency 
domain. Let F^(u) be the complex valued Fourier transform, F^{u) £ C, 
of a one-dimensional real function f{x). Then 

F^{u) = Fr{u) + jFj{u) = F,{u) + Fo{u) (9) 

as a result of all possible local decompositions f{x) = fe{x) -|- fo{x)- 
Thus, the complex Fourier transform makes explicit the only symmetry 
concept of a ID function. In case of 2D functions f{x), x = {x,y), the 
complex Fourier transform 

F''(u) = Fr{u) +jFj{u) , u = (u,v) (10) 

can also represent only two symmetry concepts. This contradicts the fact 
that in principle in two dimensions the number of different symmetries 
is infinite. 

The 2D harmonic function 

Q'^{u,x) = exp(— j27ru-a:) (11) 

is the basis function of the 2D Fourier transform on a unit area 

= jj f{x) Q'^{u,x) dx. (12) 

The ID structure of Q^{u, x) is the reason that in the complex Fourier 
domain there is no access to 2D symmetries. Looking at the decompo- 
sition 

Q^{u,x) = Q'^{u,x) Q^{v,y) = exp{-j2TTUx) exp(-j27ruy), (13) 

it is obvious that the basis functions represent a symmetry decomposi- 
tion according to 

F^{u) = Fee{u) -h Foo{u) Foe{u) Feo{u) (14) 

and, thus, support in the spatial domain a decomposition 

f{x) = feeix) + foo{x) + foe{x) + feo{x). (15) 

That is, the algebraic nature of equation (13) corresponds to considering 
products of symmetries with respect to the coordinate axes. We call 
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this line symmetry. But the limited degrees of freedom in the algebraic 
structure of complex numbers results in a partial cover of symmetry with 

Fr{u) = Fee{u) + Foo{u), Fi{u) = Feo{u) + Foe{u). (16) 

Hence, although all symmetries according to equation (14) are contained 
in the global spectral representation of the complex Fourier transform, 
these are hidden because of equation (16) and cannot be made explicit. 
Obviously this is a consequence of the complex algebra which causes the 
equivalence of the basis functions according to (11) and (13), respec- 
tively. With respect to local spectral representation this results in the 
incompleteness as discussed in subsection 2.1. 

In [Zetzsche and Barth, 1990] Zetzsche and Barth argue from a dif- 
fential geometry viewpoint that the responses of LSI filters should be 
non-linearly combined to represent i2D structures. They developed in 
the sequel a non-linear filter approach based on second order Volterra 
series [Krieger and Zetzsche, 1996]. 

Instead, our approach is an algebraic extension of the degrees of free- 
dom of a multi-dimensional Fourier transform by embedding the spectral 
domain into a domain of Clifford numbers. In the case of embedding 
a signal of dimension N in the spatial domain, the dimension of the 
algebra has to be 2'^ [Biilow and Sommer, 2001]. We call this kind 
of Fourier transform a Clifford Fourier transform (CFT). The Clifford 
Fourier transform has been modeled already by Brackx et al. in 1982 
[Brackx et ah, 1982]. 

In the case of signal dimension N = 2, the following isomorphisms 
exist: Mo ,2 — — H. Hence, for that case the quaternionic Fourier 

transform (QFT), F^{u) G H, 

F"^{u) = JJ Ql{u,x) f{x) Q]{v,y) dx (17) 

with the quaternion valued basis functions 

Qj{u,x) = exp{-i2TTux) , Qj(u, y) = exp(-j27ru2/) (18) 

represents the symmetries of equation (14) totally uncovered in the 
quaternionic domain. The algebraic decomposition of F‘^ is given by 

F^{u) = Fl{u) + iFf{u) + jFj{u) + feF|.(ri). (19) 

For real f{x) this corresponds to the symmetry decomposition (same 
order as in (19)) 

F^(ri) = Fee{u) -h Foe{u) + Feo{u) + Foo{u). (20) 
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Q'^{u,x) = exp{—i2TTUx) exp(— j27ruy) (21) 

cannot be fused as in equation (11). Those basis functions which are not 
positioned along the coordinate axes are indeed intrinsically 2D struc- 
tures and, thus, are able to represent explicitly i2D signals. 

The importance of Fourier phase as a carrier of geometric information 
is known, as well as the lack of reasonable phase concepts for 2D signals. 
The presented QFT overcomes this problem for the first time because of 
the representation 

F^{u) = \F^{u) \ exjp{i(j){u)) exp{kip{u)) exp{j9{u)) (22) 

of the QFT. Here the triple 
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of the quaternionic phase represents the ID phases in axes directions 
{ 41 , 6 ) and the 2D phase respectively. 



2.2.2 Quaternionic Analytic Signal. There are different ap- 
proaches, with limited success, to generalizing the analytic signal in 
the complex domain for representation of i2D structure. This in effect 
means generalization of the Hilbert transform to the multi-dimensional 
case [Hahn, 1996]. The quaternionic analytic signal (QAS) presented in 
[Biilow and Sommer, 2001] takes advantage of the additional degrees of 
freedom in the quaternionic domain. In the spectral domain it is 

F|(u) = (1 -h sign(u))(l -h sign(u))F^(u) (23) 



and in the spatial domain 

/l(a^) = fix) + n^fnix), (24) 

where n = is the vector of quaternionic imaginary units and 

f\j is the vector of the Hilbert transformed signal. 



fnix) 




S{x) 
Try ’ 



— V. 

TT^rcy J 



(25) 



Regrettably, there is a mismatch of the chosen symmetry concept in 
the case of an isotropic pattern. Therefore, in the next subsection we 
demonstrate another way of modeling a generalization of the analytic 
signal. 
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2.3 Monogenic Embedding of Point Symmetry 

The drawback of the QAS is the rigid coupling of the considered sym- 
metry concept to the coordinate axes of the image signal. In case of 
deviations of the patterns from that model, e.g. in case of a rotated 
pattern, the quaternionic signal results in wrong local spectral repre- 
sentations. The alternative approach of generalizing the analytic signal 
which is outlined in this subsection has been developed in the PhD thesis 
of Felsberg [Felsberg, 2002]. 

The aim of this approach is to generalize the analytic signal in a rota- 
tion invariant manner. While in [Biilow et ah, 2000] the identification of 
the isotropic multi-dimensional generalization of the Hilbert transform 
with the Riesz transform was presented for the first time, in [Felsberg 
and Sommer, 2000] the relation of the corresponding multi-dimensional 
generalization of the analytic signal to the monogenic functions of Clif- 
ford analysis [Stein and Weiss, 1971] has been published. 

As in 2D (complex domain), in 4D (quaternion domain) no rotation 
invariant generalization of the Hilbert transform could be formulated. 
This is different for embedding a 2D function into a 3D space, or more 
generally an nD function into an (n-|-l)D space. The Riesz transform 
then generalizes the Hilbert transform for n > 1. We will give a short 
sketch of its derivation in the 2D case in the framework of Clifford anal- 
ysis, which is an extension of the complex analysis of ID functions to 
higher dimensions. Furthermore, we will discuss the extension of the 
analytic signal to the monogenic signal. The monogenic signal will find 
a natural embedding into a new scale space which is derived from the 
Poisson kernel. All these results are complete representations of ilD 
signals in 2D. We finally will present a first approach of an extension to 
i2D signals. 

2.3.1 Solutions of the 3D Laplace Equation. Complex anal- 
ysis is mainly concerned with analytic or holomorphic functions. Such 
functions can be obtained by computing the holomorphic extension of 
a real ID function. The resulting unique complex function over the 
complex plane fulfills the Cauchy-Riemann equations. There exists a 
mapping of a holomorphic function to a gradient field which fulfills the 
Dirac equation. Such a gradient field of a holomorphic function has zero 
divergence and zero curl everywhere and is, thus, the gradient field of a 
harmonic potential. The harmonic potential in turn fulfills the Laplace 
equation. What is known in signal theory as the Hilbert transform and 
analytic signal, respectively, corresponds to the ID part of the mentioned 
holomorphic extension in the complex plane. For a real 2D function the 
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corresponding framework is given in Clifford harmonic analysis [Stein 
and Weiss, 1971, Brackx et ah, 1982], 

Let a; G x = xe\ A- ye 2 + ze^, be a vector of the blades (1^3)1 
of the Euclidean geometric algebra M3. Then a M3-valued analysis can 
be formulated, see also [Felsberg, 2002], for 3D vector fields g{x),g = 
giBi + 52^2 + 5363, which fulfill the 3D Dirac equation. 



Vsgix) = 0. (26) 

Here V3 is the 3D nabla operator defined in M3. Then g{x) is called a 
monogenic function and is a generalization of a holomorphic one. It can 
be shown that there exists a scalar valued function p{x) called harmonic 
potential which is related to g{x) by g{x) = V^p^x). Therefore, g{x) 
is a gradient field called the harmonic potential field. The harmonic 
potential p{x) fulfills the 3D Laplace equation, 

A^p{x) = V^Vspix) = 0, (27) 



with A3 as the 3D Laplace operator. If g{x) is harmonic, its three 
components gi{x) represent a triple of harmonic conjugates. This is 
the important property we want to have to generalize a Hilbert pair of 
functions, see subsection 2.1. 

The fundamental solution of the Laplace equation (27) is given by 



and its derivative. 



p{x) 



1 

27r|a; 



9{x) 



X 

27r|a;P 



(28a) 



(28b) 



is the harmonic field we started with. 

Before having a closer look at the fundamental solution (28) of the 
Laplace equation and its derivative, we will relate it to the monogenic 
extension f^{x) of a real 2D function f{x,y) in M3. The starting point 
is a usefull embedding of /(x, y) into M3 as a vector field, respectively 
as an es-valued function. 



f{x) = f{xei + ye 2 ) = f{x, y)c3. (29) 

Note that z = 0. Then the monogenic extension f^{x) of the real 2D 
function /(x, y) can be computed by solving a boundary value problem 
in such a way that is monogenic for z > 0 and its e3-component 
corresponds to f{x, y) for z = 0. 

The boundary value problem reads 

Asp{x) = 0 for z > 0 



(30a) 
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e 3 -^p{x) = f{x,y)e 3 for z = 0. (30b) 

Solving equation (30a) results in the fundamental solution (28) for z > 0. 
But the boundary equation (30b) introduces the contraint gs{x,y,0) = 
f{x,y). Thus, the monogenic extension /m(®) is a specific solution of 
(30) which in fact can be computed by convolving the function f{x, y) 
with a set of operators derived from the fundamental solution. 

The components of g{x) are derivations of p{x)e 3 when z > 0, 

hp{x) = ^ (31a) 



hcPx{x) 



ei-^p{x)e3 



x 

o I 13 ^31 



(31b) 



hcPy{x) 



e 2 -^p{x)e 3 

dy 



O I 13 ^23- 

2 tt\xC 



(31c) 



These functions are well-known in Clifford analysis as the Poisson kernel 
(31a) and the conjugate Poisson kernels (31b and 31c), respectively. 
While hp = g ■ 63 , the expressions (31b) and (31c) can be summarized 
by hcp = hcpx + hcpy = g A 63 , 

{xei+ye2)e3 

= 2,|x|3 ^ 

Their Fourier transforms are 



Hp{u,v,z)= exp(— 27r|ttei -|- ue 2 | 2 ;) 



(33a) 



Hcp{u, V, z) = Hp{u, v) exp(-27r|ttei -|- ve 2 \z), 



where 



Hr{u,v) 



( U(t3\ - U623 Y _ Uei + VB2 

\\uei + ve2\J \uei + ve2\ ^ 



(33b) 



(34) 



is the Riesz transform, which is the generalization of the Hilbert trans- 
form with respect to dimension. Note that in equation (34) M* = 
means the dual of the multivector M with respect to the pseudoscalar 
I 3 of M 3 . Equations (31) formulate three harmonic conjugate functions 
which form a Riesz triple of operators in the half-space z > 0. Obvi- 
ously the Riesz transform does not depend on the augmented coordinate 
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z. But equations (33) can be interpreted in such a way that z is a damp- 
ing parameter in the frequency domain for all components of the Riesz 
triple. In fact, 2 : must be interpreted as a scale parameter of a linear 
scale-space, called Poisson scale-space. 

Here we must make a remark with respect to the Fourier transform 
used in this approach. Although the signal embedding is given in a 3D 
space, the Fourier transform is 2D. Actually we are using an isotropic 
transform to correspond to the model of point symmetry. Hence, for a 
given function f{x) the complex valued isotropic Fourier transform is 

F{u,v) = JJ f{x) exp(— /327rri • x) dxdy. (35) 

Thus, the harmonics we consider are actually spherical ones. This impor- 
tant feature, together with the conception of point symmetry to realize 
the signal decomposition into even and odd components, overcomes the 
restrictions of the QFT of missing rotation invariance. 

In the following we will focus on the result of the fundamental solution 
of the Laplace equation for z = 0. Besides, we will consider the scale 
as parameter of the representation, hence, x = {x,y), and s is the scale 
parameter. 



2.3.2 The Monogenic Signal. Let f]<^^ be the monogenic signal 
[Felsberg and Sommer, 2001] of a real 2D signal f{x, y). In the plane s = 
0 the monogenic signal is composed by the es-valued representation 
f{x) = f{x,y)e^ and its Riesz transform, fp>(x), 



fM{x) = f{x) + fn{x). (36) 

The monogenic signal in the frequency domain, u = {u,v), 

F m{u) = F{u) + Fr{u), (37) 

is computed by 

Fm{u) = (1 + Hr{u)) F(u) =(^1 + ^^2 F{u). (38) 

In the spatial domain the Riesz transform results from convolving f with 
the Riesz transform kernel Hr, 



thus. 



hR{x) 



xes 
27r|a;P ’ 



(39) 
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x')dx'dy' . 
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(40) 




322 



COMPUTATIONAL NONCOMMUTATIVE ALGEBRA 



The Riesz transform generalizes all known Hilbert transform properties 
to 2D. In addition, the monogenic signal can be used to compute local 
spectral representations. Now the local magnitude is isotropic. 

The local spectral decomposition of the monogenic signal in 

/m(®) = I/m(®)I exp(arg(/^(®)e3)) (41) 

is given by the real local amplitude 

I/m(®)I = exp(log(|/M(®)|)) = exp((log(/M(®)e3))o) (42) 

and the local rotation vector 

r{x) = aigif^ix))* = (log(/M(®)e3))2, (43) 

see [Felsberg, 2002]. The local rotation vector r(x) lies in the plane 
spanned by ei and 62. Hence, it is orthogonal to the local amplitude 
vector \fj^(x)\e 3 . In the rotation plane the rotation vector is orthogonal 
to the plane spanned by f and 63. The length of the rotation vector 
is coding the angle (p between and 63 which is the local phase of the 
2D signal, 

(/^(a;) = sign(r • ei)|r|. (44) 

The orientation of the plane spanned by f and 63 expresses the local 
orientation of the 2D signal in the image plane, 9{x). 

Local phase and local orientation are orthogonal features expressing 
structural and geometric information in addition to the energy informa- 
tion represented by the local amplitude. 

Hence, from the chosen 3D embedding of a 2D signal we obtain a more 
complex phase representation than in the ID case. It includes both the 
local phase and the local orientation of a structure. But it is limited 
to the case of ilD-2D structures. Nevertheless, the result is a consis- 
tent signal theory for representing lines and edges in images which tells 
apart the difference between the embedding dimension and the intrinsic 
dimension of a signal. A practical consequence is that steering of the 
orientation in that approach is unnecessary for ilD signals. 

The phase decomposition of a monogenic signal expresses symmetries 
of a local ilD structure embedded in M^. There is an interpretation 
of the local phase other than that given by a rotation vector of a M]|~- 
valued signal in equation (43). Instead, the decomposition 

into local phase and orientation can be understood as specification of 
a rotation in a complex plane which is oriented in M^. In that case, 
for a given orientation 0, the well-known phase interpretation of a ID 
signal (equation (8b) can be used. The embedding into M3 supplies the 
orientation of the complex plane. 
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2.3.3 The Poisson Scale-Space. So far we have discussed the 
derivation of local spectral representations, including local orientation, 
from the monogenic signal for the case of vanishing damping by the 
Poisson kernel (31a) and conjugate Poisson kernels (32), respectively. 
The same interpretation applies of course for all other scale parameters 
s > 0. 

Because the Poisson kernel can also be derived for ID signals by solv- 
ing the Laplace equation in M 2 , there exists a similar scheme for that 
case [Felsberg, 2002], 

The fact that {x; s} with s being the Poisson scale parameter rep- 
resents a linear scale-space [Felsberg, 2002] is surprising at first glance. 
There exist several axiomatics which define a set of features a scale- 
space should have. The first important axiomatic of scale-space theory 
proposed by lijima [lijima, 1959] excludes the existence of other scale- 
spaces than that one derived from solving the heat equation, that is the 
Gaussian scale-space. In [Felsberg and Sommer, 2003] the reason for 
that wrong conclusion could be identified. 

As set out in subsection 2.1 the allpass characteristics of the Hilbert 
transform and also of the Riesz transform hinders their application in 
image analysis. Instead, there is a need for quadrature filters which 
are in fact bandpasses. By using the Poisson kernel and its conjugate 
counterparts (equations (33)), it is possible to design bandpasses by 
building the difference of Poisson filters [Felsberg, 2002]. These are ab- 
breviated as DOP filters and DOCP filters, respectively. In order to 
match symmetry they are either even (DOP) or odd (DOCPs). As 
hp,hcpx and hcpy these three bandpasses also form a Riesz triple. 
The set of filters is called a spherical quadrature filter. Interestingly, 
DOCP = DOCPx + DOCPy is an odd and isotropic operator which 
could not be designed in another way. 

The Poisson scale-space is not only new, but its conception estab- 
lishes a unique framework for performing phase-based image processing 
in scale-space. Hence, local phase and local amplitude become inher- 
ent features of a scale-space theory, in contrast to Gaussian scale-space 
[Felsberg and Sommer, 2003]. In [Felsberg and Sommer, 2003] there is 
a first study of the properties of the monogenic scale-space. 

2.3.4 The Structure Multivector. The monogenic signal only 
copes with symmetries of ilD structure in a rotation invariant manner 
and enables one to estimate the orientation in addition to phase and 
amplitude. How can this approach based on solving the Laplace equation 
be extended to i2D structure? In a general sense the answer is open yet. 
This results from the fact that in 2D there exist infinitely many different 
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symmetries. Because the monogenic signal is derived from first order 
harmonics as a transfer function of the Riesz transform, it follows that 
an increase of the order of harmonics to infinity would be necessary to 
cope with an arbitrary i2D structure. Hence, from this point of view 
we get stuck in a complexity problem similar to other knowledge-based 
approaches of designing pattern templates as filters. 

Nevertheless, a first shot is given by the approach of the structure 
multivector [Felsberg and Sommer, 2002]. Here the first spherical har- 
monics of order zero to three are used to design a set of filters. This set 
implicitely assumes a model of two perpendicularly crossing ilD struc- 
tures, thus representing in our approach a simple template of a special 
i2D structure [Felsberg, 2002]. 

Let be the impulse response of a spherical harmonic filter of order 
i. Then hPg{x) = 5{x) and h\{x) = hji{x). If f{x) G M 3 , then 

S{x) = f{x) + {hi * f){x) + e 3 (/i| x= f){x) + esihl * f){x) (45) 

is a mapping of the local structure to a 7-dimensional multivector, 

S{x) = So -h Siei -h S 2 C 2 -h 3363 -h 523623 + •531631 + S 12612 , (46) 

called the structure multivector. 

That response actually represents a special i2D generalization of the 
analytic signal. Hence, a split of identity of any i2D signal, projected to 
the model, can be realized in scale-space. The five independent features 
are local (main) orientation, two local ilD amplitudes and two local ilD 
phases. A major amplitude and a minor amplitude and their respective 
phases are distinguished. The occurrence of a minor amplitude indicates 
the i2D nature of the local pattern. For details the reader is referred to 
[Felsberg, 2002, Felsberg and Sommer, 2002]. 

The filter can be used to recognize both ilD and i2D structures, but in 
contrast to other filters which respond either to ilD or to i2D structure 
or mix the responses in an unspecific manner, this filter is specific to 
each type of structure. 

3. Knowledge Based Neural Computing 

Learning the required competence in perception and action is an es- 
sential feature of designing robot vision systems within the perception- 
action cycle. Instead of explicitly formulating the solution, implicit rep- 
resentation of knowledge is used. This knowledge concerns e.g. equiv- 
alence classes of objects to be recognized, actions to be performed or 
visuo-motor mappings. 

There are plenty of different neural architectures. But most of them 
have in common that they are general purpose or universal comput- 
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ing architectures. On the other hand we know that some architecture 
principles are more useful for a given task than others. 

If the designer is aware of the specific features of his problem, he 
may integrate domain/task knowledge into the very architecture of neu- 
ral computing. We will call this knowledge based neural computing 
(KBNC). 

We will focus here on algebraic approaches which are tightly related 
to the geometry of data. Our choice is to take geometric algebra as 
an embedding framework of neural computing. We developed a general 
scheme of embedding neural computing into Clifford algebras in the sense 
of a knowledge based approach [Buchholz and Sommer, 2000b, Buchholz 
and Sommer, 2001b, Buchholz and Sommer, 2001a]. As an outcome 
we could propose several special neural processing schemes based on 
using geometric algebra [Banarer et ah, 2003b, Buchholz and Sommer, 
2000a, Buchholz and Sommer, 2000c] . In our approach we are capturing 
higher order information of the data within a single neuron by exploiting 
the special multivector structure of a chosen algebra. Because we get 
access to the chosen Clifford group, we are thus able to learn geometric 
transformation groups for the first time. 

Neural learning of a model can be understood as an iterative non- 
linear regression of a function to a set of training samples. By embed- 
ding the fitting of a model to given data into geometric algebra, we use 
algebraic knowledge on the nature of the problem to constrain the fit by 
the chosen algebraic rules. Furthermore, the chosen embedding causes 
a transformation of the function from a non-linear type in Euclidean 
space to a linear one in the special Clifford domain. This is a principle 
of minimal efforts which is called Occam’s razor in statistical learning 
theory. Thus, learning the linear function in geometric algebra will be 
simpler than learning the non-linear one in vector algebra. 

3.1 The Clifford Spinor Neuron 

3.1.1 The Generic Neuron. In this section we will restrict 
ourselves to the algebraic embedding of neurons of perceptron type ar- 
ranged in feed-forward nets. Next we will summarize the basic ideas of 
computing with such neurons. 

Let us start with a so-called generic neuron whose output, y, reads 
for a given input vector x and weight vector w, both of dimension n, as 

y = g{f{x;w)). (47) 

The given neuron is defined by a propagation function /: — > D and 

by an activation function g: D — > D' . In case of a real neuron with 
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D = 'S. and w,x G M”, the propagation function 

n 

f{x) = '^WiXi + 9, (48) 

i=l 

where 0 G M is a threshold, obviously computes the scalar product of 
w and X. Because of the linear association of weight and input vectors, 
(48) is also called a linear associator. A neuron of that type with g being 
the identity operation is also called an adaline. By applying a non-linear 
activation function g to f, the neuron will become a non-linear comput- 
ing unit called a perceptron. While an adaline may be interpreted as 
iteratively realizing a linear regression by learning, a trained perceptron 
enables the linear separation of two classes of samples. This classifica- 
tion performance results from the fact that the trained weight vector is 
perpendicular to a hyperplane in input space. 

For the training of the generic neuron a supervised scheme can be 
used. That is, there is a teacher who knows the required answer, r* G M, 
of the neuron to a given input x^ G M”". Hence, the training set is 
constituted by m pairs (®*,r*). Then the aim of learning is to find that 
weight vector w which minimizes the sum of squared error (SSE) 

- m 

(49) 

i=l 

This optimization is done by gradient descent because the weight cor- 
rection at each step of the iterative procedure is given by 

dE , , 

where 77 > 0 is a suitable learning rate. 

Because in a net of neurons the error has to be propagated back from 
the output to each neuron, this is also called back-propagation. 

Finally, we will give a short sketch of combining several neurons to a 
(feed-forward) neural net. By arranging p (real) neurons in a single layer 
that is fully connected to the input vector x, we will get a single layer 
perceptron network (SLP). If (7 is a non-linear function, the output vector 
y represents in each of its p components yi a linear discriminent function 
in input space. By taking at least one of such layers of neurons and 
hiding it under an output layer, we will get a hierarchical architecture 
of neurons, which is called multilayer perceptron (MLP) architecture. 
The output layer is composed of at least one neuron which computes the 
superposition of the neurons of the preceding layer according to equation 
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(47). Hence, the MLP may be used either as a non-linear classification 
unit or for approximating any non-linear function. 

If, on the other hand, g is the identity, the SLP computes an ordinary 
matrix multiplication. 



y = Wx, (51) 

where W is the weight matrix containing the single weight vectors. If 
W is square {x,y G W^), then it represents a linear transformation of 
the input vector. 

3.1.2 The Clifford Neuron. Now we will extend the model of 
a real valued generic neuron to that of a Clifford valued one. We will 
neglect for the moment the activation function. By replacing the scalar 
product of the linear associator by the geometric product of an algebra 
^P,q^ p + q = n, we are embedding the neuron into according to 

f : Mp,g — > Rp^q. (52) 

Hence, for x,w,Q G the propagation function is 

f{x) = wx + 0 (53a) 

or 

f{x) = xw + 0, (53b) 

respectively. Note that the geometric product with respect to Mp^g 
now has to be computed. The splitting of the propagation function 
into the two variants of equation (53) follows obviously from the non- 
commutativity we have to assume for the geometric product of Mp,g. 

Having a training set T: ={(£c^,r^), ... , (a;™', r™')|a;*, r* G Mp,g}, the 
weight corrections 

Aw = ®*(r* — wx^) (54a) 

for left-sided weight multiplication and 

Aw = (r* — x^w)x^ (54b) 

for right-sided weight multiplication enable a Clifford neuron to learn in 
a similar manner as a real one. Here x means the conjugate of x in the 
Clifford algebra. By taking this involution, the appearance of divisors 
of zero during the gradient descent is prevented [Buchholz and Sommer, 
2001b, Buchholz and Sommer, 2001a]. 
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What is the benefit derived from computing (53) instead of (48)? We 
give an illustrative example. Let x = x\ + 1 x 2 and y = y\ + iy 2 be 
two fixed complex numbers, x,y ^ C with C ~ Mo,i — task 

of a complex neuron shall be learning the mapping /: x — > y. This 
corresponds to learning a weight w G C so that wx = y, that is to learn 
a complex multiplication. This is in fact a simple task for the complex 
neuron. If instead real neurons should do the same, there is a need of 
a SLP with two neurons to compute yi and y 2 , respectively. According 
to (51) the task is now to learn a weight matrix W G K(2), which 
satisfies W{x\,X 2 )'^ = ( 2 / 1 , 2 / 2 )^ with x,y G M^. Here W represents 
a linear transformation of the vector x. The SLP has to find out the 
constraints on W which correspond to the matrix representation of a 
complex number. These constraints are obviously ten = W 22 and W 12 = 
—W 2 i- As shown in [Buchholz and Sommer, 2001b] the SLP converges 
slower than the complex neuron. It is obviously better to use a model 
than to perform its simulation. Another advantage is that the complex 
neuron has half of the parameters (weights) to learn in comparison to 
the SLP. 

Both of these observations can be generalized to any Clifford neuron in 
comparison to a corresponding SLP. Because of the M-linearity of Clifford 
algebras [Porteous, 1995], any geometric product can be expressed as 
a special matrix multiplication. Hence, by choosing a certain Clifford 
algebra it is not only that a decision is made to use an algebraic model, 
but statistical learning will become a simpler task. 

3.1.3 The Clifford Spinor Neuron. There are additional ad- 
vantages to using this approach. Next we want to extend our model. 
The above toy example gives us a hint. The SLP has to find out that 
W is indeed representing an orthogonal transformation. Because each 
special orthogonal matrix W G M(2) is a rotation matrix, W rotates x 
to y in M^. Therefore, in the case of the complex neuron the equation 
wx = y can be interpreted as mapping x to y, x,y G C, by the complex 
number w. But now w is no longer a point in the complex plane but 
the geometric product of w represents a special linear transformation, 
namely a rotation-dilation, see [Hestenes, 1993]. The representation of 
such an orthogonal operator in geometric algebra is given by the sum of 
a scalar and bivector components and is called a spinor, S. In general, 
a spinor performs a dilation-rotation in the even subalgebra of any 
geometric algebra 

Hence, the complex neuron learns a spinor operation, which is per- 
forming an orthogonal transformation in C. But this is no longer true 
for our simple model of the Clifford neuron introduced in equation (53) if 
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we choose vectors x,y G The spinors of M 3 are quaternions because 
of the isomorphism Mj^ ~ H. To perform a rotation in M^, which maps 
vectors to vectors, the two-sided spinor product 

<t{x) : X — > SxS (55) 

has to be used in (53) instead. In this equation, the right half of the 
multiplication is performed with the inverse parity conjugated spinor, 
S . The geometric product is now the quaternion product because of 
the isomorphism Mj]'q ~ Mo, 2 - Instead of the vector ® G M^, we have to 
use its representation as a (pure) quaternion, x G Mo, 2 - 

The greatest advantage of embedding a geometric problem into geo- 
metric algebra is to profit from such linear realization of group actions. 
Another advantage is that the group members, and the set of elements 
the group members are acting on, belong to the same algebra. 

Because in our context the spinors are representing weights of the 
input of a neuron, we will continue using the notation w instead of S. 
The propagation function of a general Clifford spinor neuron, embedded 
into M+g, is 

f{x) = wxw~^ + 0. (56) 

Such a neuron computes an orthogonal transformation by using only one 
weight, w. Only in the case of a spinor product do we get the constraints 
necessary to use only one neuron with one weight for computing an or- 
thogonal transformation. Obviously, the general Clifford spinor neuron 
has only half as many parameters (and half as many arithmetic opera- 
tions) as the general Clifford neuron because only the even components 
of the weight multivector are used. But special care has to be taken to 
use the right coding of input and output data [Buchholz and Sommer, 
2001 b]. 

Now we will complete the neuron model by considering the activation 
function g, see equation (47). There is a need for generalizing an activa- 
tion function on the real domain to the most popular sigmoid function 

gg{u) : M — > M; u — > (1 -|- exp {—j3u))~^ (57) 

on a Clifford valued domain. So far there is no easy way to formulate 
such a generalization [Georgiou and Koutsougeras, 1992]. Therefore, we 
use a component-wise activation function 

g{u) = g{[u]i), i G {0,...,2”“^} (58) 

which is operating separately on all 2 ”,n = p + q-, components of Mp^g. 
This was first proposed by Arena et al. [Arena et ah, 1997] for the 
quaternionic case. 
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The construction of a Clifford MLP (CMLP) or of a Clifford Spinor 
MLP (CSMLP) by following the principles of a real MLP is straightfor- 
ward. To formulate a generalized backpropagation algorithm [Buchholz 
and Sommer, 2001a] is not a problem. 

In principle real MLPs, CMLPs and CSMLPs have the same theo- 
retical strength because all are universal approximators. Because they 
also use the same activation function, any potential advantage with re- 
spect to generalization of using the embedding of neural computation 
into geometric algebra should be based on the geometric product of the 
propagation function. 

3.1.4 Learning a Euclidean 2D Similarity Transformation. 

The task is to learn the plane Euclidean similarity transformation 
composed by a 2D rotation with ip = —55°, a translation with t = 
(-|-1, —0.8) and a dilation by the factor d = 1.5 [Buchholz and Sommer, 
2001b]. 

Figure 1 shows both the training data (a) and test data (b), respec- 
tively. We applied a SLP with two real neurons and four weights, one 
complex neuron and one complex spinor neuron. While the complex 
neuron has two weights, the spinor neuron has only one. Figure Ic 
shows the convergence of the three computing units during training. As 
we see, the complex neuron converges faster than the real neurons. The 
reason for that behaviour has been discussed above. The spinor neuron 
needs more training steps for learning the constraints of the spinor prod- 
uct. But after 60 epochs its error is the lowest one of all three models. 
The spinor neuron learns a spinor representation, which is indicated by 
the successive convergence of the odd components of the propagation 
function to zero. 

Generalization with respect to learning of a function means keeping 
the approximation error low if the data have not been seen during train- 
ing. The behaviour of the neurons applied to the test data should be 
comparable to that for the training data. In the case of a KBNC ap- 
proach, the learning of the model should also be robust with respect 
to distortions of the model in the data caused by noise. We overlayed 
both the training and test input data with additive median-free uniform 
noise up to a level of 20 percent. The mean square errors of the out- 
puts are shown in figures Id and le. Both the Clifford neuron and the 
Clifford spinor neuron are slightly distorted in learning the model. But 
their results with respect to the test data in comparison to training data 
indicate a good generalization. There is no significant difference in the 
behaviour of both Clifford neuron models with respect to generalization. 
This is what we expect in the case of complex algebra. The result for 
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Figure 1. Learning of a plane Euclidean similarity transformation. Upper row: left 
(a): training data; right (b): test data; middle (c): convergence of the learning; lower 
row: approximation error; left (d): training errors; right (e): test errors. 



the Clifford neurons contrasts with the performance of the real neurons. 
These are much less distorted during training but much more in the case 
of test data in comparison to the knowledge based neurons. 

Because real neurons are learning a general linear transformation, they 
are able to learn noise better than the Clifford neurons. This causes 
both the lower errors for training data and the higher ones for test data. 
Because the real neurons have no constraints to separate the intrinsic 
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properties of the Euclidean transformation from those of the noise, they 
learn a transformation which only barely copes with new data. This 
indicates a bad generalization. 

Both models of the Clifford neuron are constrained by the involved 
algebra to separate the properties of the geometric transformation from 
those of the noise. They are hindered to learn noise. Therefore, their 
results are worse than those of the real neurons in the case of training 
data but better in the case of test data. 

3.2 The Hypersphere Neuron 

Here we will present a special neural architecture which is based on 
the conformal geometric algebra. This algebra creates a non-Euclidean 
model of Euclidean geometry with the remarkable property of metrical 
equivalence [Dress and Havel, 1993]. This embedding of a Euclidean 
vector space results in a hypersphere decision boundary which, in the 
used embedding space, is a hyperplane and thus can be learned by a 
perceptron-like neuron. 

3.2.1 The Homogeneous Model of the Euclidean Space. 

It has been proposed by Hestenes in [Hestenes, 1991] that the conformal 
group C{p,q) of can be elegantly constructed in in the 

framework of geometric algebra by applying the outer product. A, with a 
unit two-blade, E:= eAeo = e_|_ Ae_. This operation is called conformal 
split. It transforms the conformal group into a representation which is 
isomorphic to the orthogonal group 0{p -|- 1, (7 -|- 1) of In [Li 

et ah, 2001] Li et al. worked out in detail the method of representing 
the conformal geometry of the Euclidean space M” in Minkowski space 
respectively in the conformal geometric algebra 

The construction 0 0 being the direct sum, uses 

a plane with Minkowski signature whose basis (e+,e_) with = 1 , 
= — 1 augments the Euclidean space M” to realize a homogeneous 
stereographic projection of all points x G M"" to points x G gee 

[Rosenhahn and Sommer, 2002]. By replacing the basis (e+,e_) with 
the basis (e, cq) in the Minkowski plane, the homogeneous stereographic 
representation will become a representation as a space of null vectors. 
This is caused by the properties of the new basis vectors. These are 
related to the former ones by e = e_ -|- e+ and eg = 5 ( 6 - — e+) with 

= Cq = 0 and e ■ eq = —1. 

Any point x G transforms to a point x G 

1 2 

X = X + -X e + eo 



( 59 ) 
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with = 0. 

Hence, points of the Euclidean space are represented by null vec- 
tors in the 5-dimensional space with Minkowski signature. Actually, 
they are lying on a subspace of called horosphere, N^. The horo- 
sphere, which is a cut of the null cone with a hyperplane, see [Li et ah, 
2001], with the remarkable property of being a non-Euclidean model of 
Euclidean space, has been known for a long time, see [Yaglom, 1988]. 
But only in geometric algebra is there a practical and relevant approach 
to exploit this powerful non-Euclidean representation in engineering ap- 
plications. The horosphere AT is metrically equivalent to a Euclidean 
space M”. It is called the homogeneous model of Euclidean space, since 
points in are represented in generalized homogeneous coordinates. 



3.2.2 Construction and Properties ofthe Hypersphere Neu- 
ron. Being metrically equivalent means that there exists a correspon- 
dence between the distance d{x,y) in and the distance d{x,y) on 
the horosphere Nff of With d{x,y) = \x — y\ = {x — yY this 

mapping reads d{x,y) = —^d‘^{x,y), see [Dress and Havel, 1993]. Given 
two points x,y G M”, their distance d{x,y) is computed simply by the 
scalar product x y in Hence [Li et ah, 2001], 



= ( 60 ) 

Any point c G can be interpreted as a degenerate hypersphere s 
with center at c and radius r G M equal to zero, 

s = c+^{c^-r‘^)e + eo. (61) 



A point X lies on a hypersphere if — cj = jrj. The general relation 
of a point x and a hypersphere s may be described by evaluating their 
distance 



Then the distance of a point, represented by the null vector x, to a 
hypersphere s G will be 



d{x,s) : { 



> 0 if ® is outside s 
= 0 if ® is on s 
< 0 if a; is inside s 



(63) 



That distance measure is used for designing the propagation function 
of a hypersphere neuron [Banarer et ah, 2003b]. If the parameters of 
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the hypersphere are interpreted as the weights of a perceptron, then by 
embedding any data points into decision boundary of the 

perceptron will be a hypersphere. Because a vector x G without 

an cq— component represents a hypersphere with infinite radius, that is 
a hyperplane, the hypersphere neuron subsumes the classical perceptron 
as a special case. 

The implementation of the hypersphere neuron uses the equivalence 
of the scalar products in g,nd li x,y G and x,y G 

then X ■ y = X ■ y. Therefore, we use the following coding 
for a data vector x G M”", x = {x\, Xn), embedded in x = 

(xi, ...,Xn, —1, —^x"^). The coding of the hypersphere s G repre- 
sented in is given by s = (ci, — r^), 1). As a result of 

that embedding, a hypersphere in M” is represented by a hyperplane in 
]^n+ 2 _ niaps the hypersphere neuron to a perceptron with a second 
bias component. 

In the above coding, the components of the hypersphere are considered 
as independent. This makes the hypersphere unnormalized and enables 
one to control the assignment of the data either to the inner or to the 
outer class of a 2-class decision problem, see [Banarer et ah, 2003b]. This 
is of special interest because the effective radius of the hypersphere will 
be influenced by the parameter (3 of the sigmoidal activation function, 
see equation (57), which is used to complete the neuron. If we remember 
the interpretation of points in g^g g degenerate hypersphere, we are 

able to assign to the data points a confidence measure by extending the 
points to hyperspheres with imaginary radius. The confidence attributed 
to a data point is 

^CONF = X + '’’“conf)^ + ®o- (64) 

From the scalar product between the hypersphere s and Xqq^^p, 

£ • Xqqj^p = -(r^ — {{x — c)^ -I- Vconf))-! (65) 

follows a shift of the effective distance of the point to the hypersphere 
during training of the neuron. This leads to an adaption of the classifi- 
cation results to the confidence of the data. 

3.2.3 The Performance of the Hypersphere Neuron. The 

hypersphere neuron is an elemental computing unit of a hypersphere 
single-layer perceptron (SLHP) or multi-layer perceptron (MLHP), re- 
spectively. Its superior performance in comparison to the classical SLP 
or MLP, respectively, can be demonstrated with respect to several bench- 
mark data sets [Banarer et ah, 2003a]. 
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Figure 2. Classification of three toy objects. Visualization of the decision surfaces 
and data sets of the three first principal components. 

In figure 2 we show an example from real world data. The problem is 
to recognize three different toy objects in a rotation invariant manner. 
For each of the 360 data sets (images with objects rotated each by 1°) 
a principal component analysis (PCA) was performed. Finally, the first 
three eigen-images of each class, each transformed to a data vector of 
length 1225, have been used for classification. Two neurons of a MLHP 
have been sufficient to separate the three classes. Obviously, the neuron 
model used is adequate for the problem. 
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1. Introduction 

This paper describes some key mathematical ideas in the theory of 
radar from a group theoretic perspective. The intention is to elucidate 
how radar theory motivates interesting ideas in representation theory 
and, conversely, how representation theory affords a better understand- 
ing of the inherent limitations of radar. Although most of the results 
presented here can be found in (Wilcox, 1991) and (Miller, 1991), there 
are significant differences in the selection and presentation of material. 
Moreover, compared with (Wilcox, 1991), (Miller, 1991) and (Moran, 
2001), greater emphasis is placed here on the group theoretic approach, 
and in particular, its ability to arrive quickly and succinctly at basic 
results about radar. 

Central to radar theory is the ambiguity function. Specifically, corre- 
sponding to any waveform w(t) is a two dimensional function A^(t, /), 
called the ambiguity function, which measures the ability of that par- 
ticular waveform to allow the radar system to estimate accurately the 
location and velocity of the target. Some waveforms perform better 
than others, and it is the challenge of radar engineers to design wave- 
forms with desirable ambiguity functions while simultaneously meeting 
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the many other design criteria which impose constraints on the set of 
feasible waveforms. 

A particularly elegant way of studying the ambiguity function is to 
write it in the form A^{t, f) = {^, P[t,f)'^)c(R) where P[t,f) is an op- 
erator acting on L^(R). In fact, P(tj) turns out to be a very special 
type of operator; it is an irreducible unitary multiplier representation 
of the additive group R^. It is here that group representation theory 
enters the picture. Functions A : G ^ C of the form A{g) = (w, p^v) 
over some group G, where pg is a representation of G, are sometimes 
known as special functions in the literature. Importantly, most if not all 
interesting facts about special functions can be deduced from a study of 
the group representation pg. 

After defining the ambiguity function in Section 2, a brief introduc- 
tion to representation theory is presented in Section 3. A feature of 
this presentation is that multiplier representations, along with their con- 
nections to ordinary representations and projective representations, are 
highlighted. Whereas it is customary to study the ambiguity function 
via the representation theory of the Heisenberg group, this paper studies 
instead the relevant multiplier representation of R^. Although both ap- 
proaches are equivalent, the authors believe the multiplier representation 
approach is the more natural. 

Section 4 derives fundamental facts about the multiplier represen- 
tation theory of R^ and how it relates to the ambiguity function. A 
somewhat novel contribution is a generalised version of Moyal’s identity 
(Theorem 2), whose proof has a more direct and intuitive flavour than 
that of previous proofs of Moyal’s identity. Also covered in Section 4 
are the various ways of realising the representation P[tj) on different 
Hilbert spaces. These representations are equivalent to each other, but 
depending on the problem at hand, some spaces can be easier to work 
in than others. 

The results in Section 4 are applied in Section 5 to answer several 
questions about ambiguity functions. A generating function approach 
is provided for finding explicit formulae for the ambiguity functions of 
Hermites. It is proved that the Hermite waveforms have the distinguish- 
ing property of having rotationally symmetric ambiguity functions. The 
potential benefits of hypothetically being able to use multiple waveforms 
are touched upon too. 

Finally, Section 6 concludes by stating that the ambiguity function 
studied in this paper is the narrow band ambiguity function and is an 
approximation, albeit a good one in many situations, to what is known 
as the wide band ambiguity function. It is explained that the latter also 
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can be studied from a group theoretic perspective, but that such a study 
is not undertaken here. 

2. How a Radar Works 

At the simplest level a radar transmits a waveform w(f) which is then 
reflected from the scene. In fact, the waveform is modulated onto (that 
is, multiplied by) a much higher (carrier) frequency sinusoid. 

The reflection is picked up at the radar and processed to produce a 
picture of the scene. We assume for the purposes of this discussion that 
the radar is looking entirely in one direction. We can think of the scene 
for the moment as a collection of scatterers at various distances ry and 
moving with velocities vp relative to the radar. We measure distances 
in units of time — the time needed for light to travel that distance — 
and we measure the velocities in multiples of the speed of light. This 
prevents the proliferation of c’s (the speed of light) in the formulae. Thus 
the scene can be regarded as a linear combination 

scene{t) = '^Ck5{t - ry,v - Vk) (1) 

k 

where S{t,v) is the “delta” function — that is a point mass — at the 
origin of the “range-Doppler” (t, v) plane. The term Ck is the (complex) 
reflectivity of the scatterer. 

Before proceeding further we remark that because radar (unlike con- 
ventional light based viewing systems) is able to keep track of the phase 
of the signal, or at least phase changes in it, processing in radar is done 
in the complex domain. A complex waveform can be modulated onto 
a carrier by using a phase shift to represent the argument of the wave- 
form. The presence of the high frequency carrier and a stable oscillator 
producing the sinusoid means that the radar can detect phase shifts and 
is capable of producing a reasonably good approximation to the Hilbert 
transform of the return. As a result, radar engineers work in the com- 
plex domain and assume that their signals are complex, the argument 
corresponding to a phase shift in the carrier. 

The signal returned to the receiver — and for simplicity we assume 
that the transmitter and receiver are collocated — is then the convolu- 
tion of the transmit signal and the scene. That is, 

ret(t) = ^ Cfcw(t - (2) 

k 

where fc is the carrier frequency. The 2’s have appeared because we 
are considering the round-trip of the signal rather than the one-way 
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trip. The return is then (after stripping off the high frequency carrier) 
correlated with another (or the same) waveform v(t). The result is 

proc(t) = / ret(r)v(r — t) dr 

r . (3) 

= ^ Cfc / w(r - - t) dr, 

which is a linear combination over the individual scatterers, as one would 
expect. The expression 

J w(r — 2r)e^^*^-^‘''^v(r — t) dr (4) 

tells all about the way in which the radar behaves. Once we know this 
function and the scene, we can reconstruct the return. Of course this is a 
gross simplification but for the purposes of this paper it will be enough. 
It is important to mention that the key issue for the radar engineer is 
to do the inverse problem: given the return, how to extract the scene. 

Briefly, we mention that there is scope for extraction of information 
from multiple waveforms rather than the single waveforms discussed 
here. Even this situation can be treated as that of one very long pulse 
and the expression (4) continues to be relevant. 

By a change of variable and a renormalization of the units of mea- 
surement of the velocity, equation (4) becomes 

^w,v(T/) = y w(r)e^’"*-^^v(r - t) dr. (5) 

This is the radar ambiguity function, or at least one form of it, and is 
the focus of the study of this paper. 

At this point we impose some mathematical constraints on the situa- 
tion. The functions w and v are always assumed to be “finite energy” 
signals; that is, they belong to L^(R). It is clear then that the integrand 
is integrable and indeed, by the Cauchy-Schwarz Inequality, that 

|Aw,v(i,/)| < ||w||l2(r)||v||l2(r), (t,/)GR^ (6) 

for all v,w G L^(R). The expression 

/>(.,.n(v)(T) = e-“/-v(T-«) (7) 

is clearly significant in the theory of the ambiguity function. We note 
that P[t,f) is a unitary operator in L^(R), and that 

Aw,v(t,/) = (w,ppj)v)^ 2 (R) • 



( 8 ) 
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= e-2«/i^e-2«/2(r-ii)(v)(r -ti~ ts) (9) 

= e^”-^'*V(ii+t2,/i+/2)(v)M, 



that is, 

P{tl,fl)P{t2,f2) ~ ® ^ P{tl+t2,fl+f2)‘ (^9) 

This makes p a multiplier representation of with multiplier 

a((ti,/i),(t2,/2)) = e-2-*i/^ (11) 

because 

P{G+t2,h+f2) = 0-((^l>/l)>(^2,/2))/J(tiJi)P(i2,/2)- (12) 

Our aim is to make sense of these ideas and examine their consequences. 

3. Representations 

Since representations, or at least multiplier representations, play a 
role in the radar ambiguity function, we spend some time discussing 
them; first ordinary representations and then multiplier representations. 
The abstract theory of representations of groups goes as follows. Let 
G be a locally compact group. Such a group has a Haar measure me-, 
which is invariant under left translation: 

yheG, [ F{hg) dmeig) = f F{g) dmdg), (13) 

Jg Jg 

for any integrable function F on G. 

A (unitary) representation of G is a continuous homomorphism 

tt:G^U{9)) (14) 

into the unitary group of a Hilbert space It will be convenient to 
write the image of G G under this map as Tig instead of the more 
conventional Ti{g). Continuity means, in this case, that the maps 

9^{C,T^gO (15) 

are continuous for all The theory of unitary representations 

of locally compact groups, and of Lie groups in particular, is extensive. 
Here we focus on those (small) parts we need for the development of the 
theory of the radar ambiguity function. 
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Two representations vr and 0 of G on the Hilbert spaces Sj and R are 
equivalent if there is an isometry V : ^ M such that 

V7Tg = egV {geG). (16) 

In pictorial form, vr and 9 are equivalent if there exists a V such that 



V 



commutes. Without the isometry constraint, the operator V is called 
an intertwining operator of tt with 9. If every intertwining operator of 9 
with itself is a scalar multiple of the identity, 9 is said to be irredueible. 
This is equivalent to there being no non-trivial subspace of ^ invariant 
under the action of all of the operators 9g {g G G). The connection 
between invariant subspaces and intertwining operators (via the spectral 
theorem) which makes these two definitions equivalent is called Sehur’s 
Lemma. 

When the group G is compact every representation is a direct sum 
of irreducible representations and this decomposition is unique up to 
equivalence of representations. This is the content of the Peter-Weyl 
Theorem. Unfortunately, the groups of special interest in radar are not 
compact and it is not true for these groups that every representation 
is a direct sum of irreducibles. In any case, every representation is 
a direct integral of irreducibles; see (Mackey, 1976). In general this 
decomposition is not unique but, for the groups of major interest to us, 
uniqueness (almost everywhere) holds. 

As we have already seen, it will be necessary to consider multiplier 
representations as well as ordinary representations. For this, we need 
a multiplier. This is a Borel map a : G x G ^ T, where T is the 
group under multiplication of complex numbers of absolute value 1 , that 
satisfies the eoeyele condition 

o-{gi,g2)(y{gig2,g3) = o-{gi,g2g3)o-{g2,g3) {gi,g2,g3 ^ G), (17) 

to which we add the normalization fj(l, 1) = 1. Then a multiplier rep- 
resentation is a map p : G ^ that satisfies 

Pqi 92 g2^ Pgi Pg 2 (ill; i/2 ^ ^); 












V 









(18) 
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and that is measurable in the same sense that an ordinary representa- 
tion is continuous. We remark that while the multipliers and multiplier 
representations considered in this paper are all continuous, it is impor- 
tant for the development of the general theory to permit multipliers 
and representations that are only Borel. The definitions of equivalence 
and irreducibility for multiplier representations are unchanged from the 
ordinary representation case. The projective unitary group is 



V{^)=U{^)/{cl:ceT). (19) 



Multiplier representations give rise to projective representations (that is, 
continuous homomorphisms from G to V^Sj)) via composition with the 
quotient map p : Conversely, since there is a Borel cross 

section rj : (that is, a right inverse of p) any projective rep- 

resentation gives rise to a multiplier representation. We note, however, 
that there are many such cross sections, and so there are many multi- 
plier representations giving rise to the same projective representation — 
indeed different multipliers are involved in general. 

The multipliers on a group G form an abelian group under multipli- 
cation. A multiplier (or cocycle) a is a coboundary if there is a Borel 
map j) : G ^ T such that 



(^{91,92) 



4>{9i)4’{92) 

4>{9i92) 



(20) 



Coboundaries form a subgroup of the group of cocycles of G and two 
elements of the same coset are said to be cohomologous. Cohomologous 
multipliers have essentially the same representation theory. If and cj 2 
are cohomologous: 



0-1(51,52) 



4>{9i)4'{92) 

4>{9i92) 



0-2(51,52), 



(21) 



and 5 is a di-representation, then a simple check reveals that 



5 ^ 4>{9)Pg 



(22) 



is a d2-representation. 

As we have stated, a projective representation k : G ^ lifts to 

different multiplier representations using different Borel cross-sections 
of the quotient map p. However different lifts just produce cohomol- 
ogous multipliers. If they are then made representations for the same 
multiplier using the trick described in equation (22), then the two repre- 
sentations are equivalent. Thus the theory of multiplier representations 
is essentially the theory of projective representations. 
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One final issue on the general theory of representations needs to 
be addressed before we return to radar theory. This is that projec- 
tive/multiplier representations are really ordinary representations but 
for a different group. Before justifying that we point out that if pro- 
jective representations did not exist (as the radar ambiguity function 
demonstrates they should), then we would have to invent them. There 
is a remarkable and beautiful theory for constructing representations 
of groups called “the Mackey analysis”; see (Mackey, 1976). To give a 
detailed exposition of this would take us well beyond the focus of this 
paper. We merely mention that, in order to use the Mackey analysis to 
construct ordinary representations of a group G, it is necessary to con- 
sider projective representations of subgroups of G. On the other hand, 
if we start trying to construct projective representations of G using the 
extension of the Mackey analysis to the projective case, then we still 
only have to consider projective representations of subgroups. Projec- 
tive representations form a natural completion of the class of ordinary 
representations . 

Now let p be a cr-representation of a group G. We show how to make 
p into ordinary representation of a larger group. We can form a new 
group G whose elements are pairs [g,z], where g ^ G and z G T, and 
with multiplication given by 

[gi,zi][g2,Z2] = [gig2,ziZ2a{gi,g2)]. (23) 

The identity element is (1 g, 1) and the inverse is given by 

[9,z]~^ = [g~^,za{g,g-'^)]. (24) 

Note that a consequence of the normalization it( 1, 1) = 1 is that 

O'CS',!) = ^^(1,5) = 1 {g(^G). (25) 

The group G is called a central extension of the group G by the circle 
group T. By this we mean that T (as the subgroup {[0,z] : z G T}) 
is a subgroup of the centre of G and the quotient G/T is isomorphic 
to G. Central extensions are classified by the group of cocycles modulo 
coboundaries. For details we refer the reader to (Maclane, 1975). 

Thus, for any fj-representation p of G, 

'^{[g^z]) =zpg (26) 

is an ordinary representation of G. There is an exact correspondence 
between the u-representation theory of G^ and the ordinary represen- 
tation theory of those representations of G that restrict on the central 
subgroup T to be the homomorphism [p, z] -^'zl. 
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4. Representations and Radar 

We return to the radar ambiguity function and how representation 
theory impinges on it. As we have seen in Section 2, the radar ambiguity 
function is expressible in the form 

Aw,v(t, /) = (w, P(t,/)v) , (27) 

where {t, f) P(t,f) is a cj-representation of and 

a((t,/),(t',/0) =e-2-/'*. (28) 

We explore some of the properties of this represention over the next few 
subsections. 

4.1 Basic Properties of the Ambiguity Function 

We can quickly deduce four fairly straightforward properties: 

Ambl For all u, v G L^(R), Au^v is in L^(R^) and || Au^v||l 2 (r 2 ) < 
I|u||l2(r)||v||l2(r). 

Amb2 The map (u, v) ^ Au^v is conjugate bilinear from L^(R) x L^(R) 
to L2 (r2). 

Amb3 Au,v{t,f) = e^’"*-^‘Av,u(-i, -/)• 

Amb4 For u, v G L^(R), 

f) = e^”-^*Au^v(/, -t). (29) 

Here, T denotes the Fourier transform. 

Proof: The proof of Ambl will be dealt with as part of a more specific 
theorem in Section 4.3. The conjugate bilinearity is clear, hence Amb2 
follows from Ambl. The proof of Amb3 is a simple calculation. First 
notice that 

4 ,/) = (30) 

in view of (10). It follows that 

Av,u(-T-/) = (v,P(_t_/)u) 

= (/5(_t,_/)V,u) 

= e2-^‘A„,v(t,/). 



(31) 




348 



COMPUTATIONAL NONCOMMUTATIVE ALGEBRA 



To deal with Amb4, we note first that, by the Plancherel theorem, 
(u,v) = (jFu,jFv) (u,v G L^(R)). 

Further, it is easy to check that 

Now it follows that 

f) = (J^U, j)J^v) 

It is remarked that Amb3 is a particular case of a more general 
formula applying to any cr-representation p: 

{u,pg^r) = a{g,g~^){^r,pg-lu) (5 G G, u,v G L^(R)). (35) 

4.2 Irreducibility 

The following result is really one of the two most important results in 
the theory of the radar ambiguity function. The other is Theorem 2. 

Theorem 1 The a -representation p is irredueihle and, moreover, it is 
the unique irredueihle a -representation of R^ up to equivalenee. 

Proof: The proof of the uniqueness of p would take us too far from 
the key aim of this paper. It is called the Stone-von Neumann Theorem 
and requires some deep ideas in representations of R. Even the proof 
of irreducibility relies on some results in harmonic analysis that go well 
beyond the scope of this paper. To prove irreducibility we use Schur’s 
Lemma. Suppose that B : L^(R) ^ L^(R) is an intertwining operator 
of p with itself. Then, in particular, it commutes with the translation 
operators 

Ph,o)(v)(r) = v(r-t). (36) 

Such an object B is known as a pseudo-measure. The only fact we need 
about pseudo-measures is that they have a Fourier transform B(f) such 
that 

^)(/) = %)9(/) (/gR), (37) 

and that this Fourier transform specifies a pseudo-measure uniquely. We 
refer the interested reader to (Katznelson, 1968) for details. Now we note 
that 



(32) 

(33) 

(34) 
□ 



p{oj')Mif) = v(/ + /0- 



(38) 
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Since B must commute with these operators too, its Fourier transform 
is translation invariant and so must be constant. It follows that B itself 
is a scalar multiple of the identity and so p is irreducible. □ 

4.3 Moyal’s Identity 

Here we establish a result that is one of the key special features of the 
representation p and the ambiguity function. Before we state the result, 
we establish some terminology and notation. The Hilbert space tensor 
product of two Hilbert spaces S) and R is the completion of the linear 
space of all finite formal sums 



^Ufc(g)Vfc, (39) 

k 



where and are in Sj and ^ respectively. The completion is with 
respect to the norm obtained from the inner product 






( 1 ) 

k 









( 2 ) 



( 2 )\ _ 



n' / = 



k' 



k,k' 









.( 1 ) ^( 2 )\ 
k /f)- 



(40) 



We assume there is a conjugate linear isometry J : ^ (in fact 

this will be conjugation in L^(R) in our context). Evidently, given any 
conjugate bilinear map B : Sj x 2, that satisfies 



^H(ufc,Vfc)||^< (41) 

k 



for any finite set {(u^, v^)} of elements of ij x .^, there is a continuous 
linear map B® : Sj 0 R ^ 2 such that the diagram 

10 J 

Sj X R — ^ Sj 0 R 




extends to a continuous linear map from (8> ^ £. We note that 

the tensor product of L^(R) with itself is just L^(R^). The following 
theorem is in essence Moyal’s identity. 



Theorem 2 The map (u, Jv) ^ ^u,v extends to an isometry from 
L^(R^) to itself. 
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Proof: It is easy to see that this map (for finite sums of simple tensors 
in the first instance) is just 

d>(F)(t, f) = j F{t, t - dT, (42) 

which, since we know that the Hilbert space tensor product of L^(R) 
with itself is just L^(R^), clearly satisfies the condition (41). If we write 
W : L^(R^) ^ L^(R^) for the map 

W{F){T,t) = F{r,T-t), (43) 

it is clear that it is an isometry of L^(R^) and that is just {I <SiF)oW 
where F is the one dimensional Fourier transform. The result is now 
clear. □ 

Corollary 3 For ui,vi,u2,v2 G L^(R), 

(^ui,vi,^U2,V2)l2(R2) = (ui,U2)l2(r)(vi, V2)l2(R). (44) 

It is equation (44) that is commonly referred to as Moyal ’s Identity. This 
has in turn another corollary. 

Corollary 4 For u, v g L^(R), 

II^v,v||l 2 (r 2 ) = ||v|| . (45) 

This is a form of the Heisenberg uneertainty prineiple. To explain its 
significance, we need to make a few remarks about the way radars nor- 
mally operate. When the return comes back into the receiver from a 
scene, noise is added by the receiver. This is just thermal noise in the 
electronic components. To a very good approximation this is white Gaus- 
sian noise — a stationary Gaussian random process in which restrictions 
to disjoint intervals are independent. Such a process (with finite energy) 
cannot exist mathematically, but as already stated, this is an approxima- 
tion. If one asks what waveform v when used for filtering in the receiver 
will maximize the signal-to-noise ratio, one can show that the correct 
choice is V = w. So it is normal (at least in theoretical discussions of 
radar) to assume that v = w — the so-called matehed filter. 

We note that by (3) the processed return is just the convolution of the 
range-Doppler scene with the ambiguity. Evidently then, so that we can 
extract the scene from the return, we would like Hv,v (or Hv,w) to be 
a delta function or close to it. Moyal’s identity makes this impossible. 
Clearly the value of Hv,v at the origin is just ||v|p and this is also its 
norm, so that it must have considerable spread. 
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As we shall see later, an important collection of waveforms (in the- 
ory but not in practice) are the Hermite functions. In fact it will be 
convenient for us to use a renormalized form of them. We write 

Vn(t) = 7T-V4(2«n!)-V2(_i)-e-tV2^^(i)^ (46) 

where is the nth Hermite polynomial, defined implicitly by the gen- 
erating equation 

V (47) 
n\ 

n 

The form an orthonormal basis of L^(R). It follows from Moyal’s 
identity that the collection {Av„,vm}nm therefore forms an orthonormal 
basis of L^(R^). 

4.4 The Symmetric Ambiguity Function 

In practical radar systems, the location (t, /) of a target is usually 
estimated by maximising the magnitude of the inner product between 
the received waveform v(r) and the adjusted version P(^j)(w)(r) of the 
transmitted waveform w(r). Here, / 0 (tj)(w)(r) corresponds to what 
would have been received in the ideal case had there been a single target 
at “distance” t moving at “velocity” /. Therefore, it is only the magni- 
tude of the ambiguity function, and not its phase, which provides useful 
information about the performance of the system. 

Let (j) : R^ ^ T be an arbitrary Borel map. Then the magni- 
tude of {y equals the magnitude of the ambiguity func- 
tion Moreover, the results of Section 3 imply that = 

f)p[tj) is actually a jv-representation of R^ for some multiplier v co- 
homologous to a. Therefore, what is of interest is not the properties of 
the particular ambiguity function Av^w{t,f), but rather the properties 
shared by all functions of the form (v, Since multiplier repre- 

sentation theory does not distinguish between cohomologous multipliers, 
this shows that radar theory really is eoneerned with the a -representation 
theory ofR?, no more and no less. 

Replacing a by a cohomologous multiplier v changes (the phase but 
not the magnitude of) the corresponding ambiguity function. Being able 
to work with different ambiguity functions can simplify calculations. 
The symmetric ambiguity function is now derived by considering the 
multiplier v defined by 









( 48 ) 
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This is cohomologous to a because 



-TTitf f' 






g-7Ti(t+t')(/+/') ■ 



(49) 



In fact, one can show that up to coboundaries and automorphisms of 
R^, this is the only cocycle on R^. Switching to this multiplier from a, 
we obtain an alternative formula for p, this time as a z^-representation. 
From the formulae (22) and (49), we obtain 

= e"*^V(i,/)(v)(r) = e-/(‘-2-)v(r - t), (t, /) G R^. 

(50) 

This results in the following form for the radar ambiguity function 



^S(T /) = / u(r + ^)v(r - dr. 



t s 



t , 



(51) 



/R 



This is referred to as the symmetric form of the ambiguity function. 

Note that because z/((t, /), (— t, — /)) = 1, we obtain from (35) the 
tidier form 

^S(i, /) = ^v,u(-t,-/) (52) 

of Amb3. In a similar vein, the new form of Amb4 is 



f) ~ ^ii,v(/, t). 






- zlH 



(53) 



4.5 The Heisenberg Group 

Using the ideas of Section 3 and the cocycle a corresponding to p, 
it is possible to form a central extension of R^ by T. The result is a 
group Go whose centre Zq is (in this case) isomorphic to T and for which 
Go/Zo is isomorphic to R^. It is customary instead to use the R- valued 
cocycle ur, given by 



= ft, 



(54) 



so that a = The result of applying the trick of equation (23) 

in this context produces a central extension of R^ by R, which is called 
the (3-dimensional) Heisenberg group 



(55) 




The corresponding representation of 7i is 



p{t, /, . 2 )(v)(r) = e2^*^e-2^*^^v(r - t). 



(56) 
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There is another extension (but isomorphic qua central extensions) 
formed using v, but we refrain from giving the details. In fact, we 
continue to present the theory of the radar ambiguity function in terms of 
multiplier representations of rather than the slightly more customary 
ordinary representation theory of the Heisenberg group. 

4.6 The Bargmann-Segal Representation 

While the Stone-von Neumann Theorem tells us that there is only one 
irreducible cj-representation of R^, or equivalently that there is only one 
irreducible representation of the Heisenberg group that restricts to the 
homomorphism 2 : ^ centre of TT, there are many equivalent 

ways of seeing this representation. The way we have discussed so far is 
called the Schroedinger representation of 7i. Here and in the following 
section we present two other ways of looking at this representation. 

It is shown in (Wilcox, 1991) that the Hermite waveforms v„(t), de- 
fined in (46), have ambiguity functions whose peak at the origin is as 
sharp as possible (in a sense made precise there). The Hermite wave- 
forms are also the eigenfunctions of a particular linear operator defined 
in Section 4.8. Since the Hermites appear to play a fundamental role, 
it is desirable to construct a cj-representation on a new Hilbert space 5^ 
such that the waveforms corresponding to the Hermites are as simple as 
possible. Such a representation is stated below. 

Consider the space 5 of entire functions a{z) = ^nz'^ satisfying 

|a„p < 00 . This is a Hilbert space with the inner product 



(a,b) = y^nlanbn- 

n 



(57) 



This is actually a reprodueing kernel Hilbert space, in that there exist 
elements e„, for any u £ C, such that 



(3-)0«) — a G 5"- 



Furthermore, the vectors 



in{z) 




n = 0, 1, • • • 



(58) 



(59) 



form an orthonormal basis for 5^. 
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We can explicitly write down an isometry 'I' : L^(R) ^ ^ which maps 
Vn(t) to ]n{z), namely 



(_xj 

= ^(u,v„)j„(z) 

(It. 



n=0 



I- 

Jr vt 



(60) 



Here, the second line is obtained with the help of (47). 

The isometry 'h maps the representation p to the equivalent repre- 
sentation = vl/ o p o 4'“^ on 5^. This is called the Bargmann- Segal 
a -representation of R^. It is given explicitly by 



P{K 



t -|- 2th f\ 



The derivation of (61) is postponed until Section 4.8. 



(61) 



4.7 The Lattice Representation 

The third equivalent version of the representation p is the so-called 
lattiee representation. It arises as an induced u-representation from the 
lattice subgroup of R^ or equivalently as an induced ordinary repre- 
sentation from the corresponding subgroup of the Heisenberg group 7i. 
The Hilbert space here is the space of all functions r : R^ ^ C satisfying 

r(x -|- n, 2/ -|- m) = e“^™^r(x, p), (62) 

and which are square integrable in the sense that 

1 

\r{x,y)\'^ dx dy < oo. (63) 

Note that, in view of (62), the integrand is periodic. This space has the 
obvious inner product. Now we can define a cr-representation p^ of R^ 
by 

p\tj) (r) {x,y) = e‘^^^^^r{x + t,y + f). (64) 

It is not hard to see that this is a u-representation. To prove irreducibil- 
ity we show directly that it is equivalent to p. The intertwining operator 
is known as the Weil- Brezin- Zaks transform and is given by 

OO 

Z[u]{x,y)= 

k=—oo 




e~^^i^yu{k - x) (ugL2(R)). (65) 
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It is straightforward to check Z[m\ satisfies both (62) and (63). Its inverse 
is given by 

Z“^[r](r) = / r{-T,y) dy. ( 66 ) 

Jo 

To check the intertwining property, we observe 

OO 

Z[p(,,^)(u)](x,y)=e2-/- V e-2-^(/+^)u(A:-x-t) 

(67) 

= /^(),/)(^[u])(a;,y)- 

Note that this forces Z to be a scalar multiple of an isomorphism and it 
is straightforward to see from this that it is indeed an isomorphism. 

Although not discussed here, there are links between the lattice rep- 
resentation and number theory. For instance, the Jacobi theta function 
appears in the expression for Z[vq](x, y), where vq is the zeroth Hermite 
waveform defined in (46). 

4.8 The Lie Algebra Representation 

Recall that the cj-representation p extends to a representation on the 
Heisenberg group TL. Since TL \s a, Lie group, the representation p in- 
duces a Lie algebra representation on the Lie algebra f) associated with 
TL. The Hermite polynomials naturally arise in this framework as the 
eigenvectors of a particular linear operator. 

The Heisenberg group has a Lie algebra f) which, as an additive group, 
is and has generators T, F, Z satisfying 

[T,F] = Z, [T,Z]=0, [F,Z]=0. (68) 

The representation p oi TL produces a representation, which we also 
denote by p, of the Lie algebra by unbounded operators on L^(R). In 
the case of p, the representations of the generators are 

p{F) = -2mt; p{T) = p{Z) = 2m. (69) 

It is easily checked that these satisfy (68). Let 






and observe that 



(71) 
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Its adjoint is 

We define N = p(A)*p(A) and remark that it is a self-adjoint operator. 
The normalized solution of p(A)(v) = 0 in L^(R) is 

vo(i) = (73) 

the zeroth Hermite function in (46). In fact, the eigenvalues of N are 
{0, 1, 2, . . .} and the corresponding eigenvectors of N are the v„ as 
defined in (46). Indeed, one can show easily that 

p(^)*(vn) = \/nTTv„+i 
p{A){vn) = \/nv„_i. 

The Bargmann-Segal representation stated in Section 4.6 is now de- 
rived. Define the Hilbert space as in Section 4.6 and let 'k be the 
isometry in (60). Deriving an expression for = \ko po directly 
is difficult, but it can be found indirectly by first calculating the corre- 
sponding Lie algebra representation as follows. 

Since p = o p®® o 'k, it follows that p = 47“^ o p^s q \J/ and p* = 
4/“^ o pBS o 47. Therefore, p(H)(vo) = 0 implies p^®(yl)(jo) = 0, that 
is, p^®(H)(l) = 0. Similarly, p(H)(v„) = yTlv„_i implies p^S(H)(z"') = 
nz^~^. Therefore, 

pSS(H) = (75) 

ctz 

or upon substituting for A, 

^(T) + ^^(F) = v/2j^. (76) 

2m dz 

It is readily shown that p(T) and p{F) are skew-adjoint, hence so are 
p^®(T) and p^®(F). That is to say, p^®(T)* = — p^^(T) and similarly 
for p^®(T). Therefore, taking the adjoint of (76) yields the new equation 

-p^(T) + ;^p^(F) = V2z (77) 

ZTTl 

where use has been made of the facts that the adjoint of ^ is z and 
the adjoint of a complex number is its complex conjugate. Solving the 
equations (76) and (77) gives 

• kS(F) = v/2„(L + ,). (78) 
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"'«>(“)(") = "(*■ + 7^) ■ (79) 

pfoj)(a)(2:) = exp + V2mfz^ a(^z + V27rif'^ . 

Equation (61) now follows upon noting that = P(o^/)7’(l^o)' 

4.9 The Metaplectic Representation 

Here we consider automorphisms of that preserve the structure 
we have discussed so far. In fact, it is more customary at this point to 
work with the equivalent multiplier u, defined in (48), rather than the 
multiplier a. We consider, then, the continuous automorphisms a of the 
group that preserve the multiplier w in the sense that 

These are just the members of SL(2, R) — 2x2 matrices with determi- 
nant 1. Evidently, for any such automorphism, o a is an irreducible 
i/-representation of R^ and so, up to equivalence, must be itself by 
the Stone-von Neumann Theorem. Thus there exists a unitary operator 
U (a) such that 

C/(a)-VS*,^^)C/(«) = (t, /) G R'. ( 81 ) 

Since p^'^^ is irreducible, U{a) is unique up to a scalar multiple. 

Moreover, U{af3) must have the same effect on as U{a)U{P) and 
so, by the irreducibility of p^'^\ these two must also differ by a multi- 
plicative constant. In other words, the map a U{a) is a projective 
representation of SL(2,R). In fact, the multiplier is cohomologous to a 

two-valued one. Thus, there is a double covering SL(2,R) of SL(2,R) 
and U lifts to a unitary representation of SL(2,R) on L^(R). This is 

called the metaplectic representation of SL(2,R). (The representation 
is not irreducible, but rather, L^(R) decomposes as the direct sum of 
two subspaces, where the representation restricted to either of the two 
subspaces is irreducible.) 

5. Ambiguity Functions 

This section uses the earlier results on the cj-representation theory of 
R^ to establish several facts about ambiguity functions. 
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5.1 Ambiguity Functions of the Hermites 



Let Pnm{t,f) = {'^n, P{tj)'^m) denote the ambiguity functions asso- 
ciated with the Hermite waveforms v„ defined in (46). This section 
outlines how a closed form expression for pnm{t, f) can be obtained. 

The Hermite waveforms are simpler in the Bargmann-Segal space, 
so the first step is to notice that Pnm{t,f) = (jn, because the 
isometry maps v„ to j^; see Section 4.6 for notation. Next, write 
down the generating function 



G{t,f;a,b) 






(Jn,/5ftj)(jm)) 

n,m=0 



\/ nl ml 



(82) 



where the second equality follows from the fact that e^j, defined implicitly 
by (58), is given explicitly by 







Using the reproducing kernel property (58) shows that 



(83) 



G{t,f-,a,b) = p^^^j,^(eb)(a) 

— :^(i+27i-*/)+7f*/tgfe(a+(t-27ri/)/U2) 



(84) 



Equating coefficients of in (82) and (84) results in an explicit ex- 
pression for Pnm{t,f) = (jn, Moreover, making the substitu- 
tion (t, /) = (r cos 9, r sin 9) allows this expression to be written in terms 
of the Laguerre polynomials. See (Wilcox, 1991) or (Miller, 1991) for 
details. 



5.2 Symmetries of Ambiguity Functions 

This section introduces the basic machinery for studying the possi- 
ble symmetries of an ambiguity function. Although only the magnitude 
\A{t, f)\ of an ambiguity function A{t, f) is of interest in general, it 
is significantly simpler to study symmetries of A(t, f) rather than of 
\A{t, f)\. Changing multipliers changes the phase of the ambiguity func- 
tion, thereby potentially altering the symmetry. It is therefore important 
to note that this section chooses to work with the symmetric ambiguity 
function defined in Section 4.4. 

Recall from Section 4.9 that the elements of SL(2,R) preserve the 
multiplier u. Moreover, from (81), it follows that if AG){t,f) is an 
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ambiguity function then so too is (a(t, /)) for any a G SL(2,R). 
Indeed, 

= (u, [/(«)■ V(ij)t^(a)v) (85) 

^ f)- 

It follows that if 5 is a subgroup of SL(2, R) and the waveform u satisfies 
U{a)u = u for all a G 5 then the ambiguity function gIu'^u is symmetric 
with respect to S. In fact, since it is shown in (Miller, 1991) that = 
only if u = Au' for some A G T, we say that u has an 
5-symmetric ambiguity function if and only if there exists a function 
A : 5 ^ T such that U{a)u = A(a)u for all a G 5. 

One interesting subgroup of SL(2, R) is the rotation group. However, 
it is more natural to rotate a dilated version of the ambiguity function 
so that the units of time and frequency are compatible. Let S denote 
the group whose elements S{6) are dilated rotations, namely 



5(0) = 



^ ^ ^ 
0 V Sin0 

COS0 — 27rsin0\ 

^ sin 9 cos 0 J ’ 



— sin 6\ 
cos 9 J 



0 



9 G [0,27t). 



( 86 ) 



Note that 5 is a subgroup of SL(2,R) and that S{—6) is the inverse 
of 5(0). It is now shown that the only waveforms with rotationally 
symmetric (that is, 5-symmetric) ambiguity functions are the Hermites. 

Anticipating the involvement of the Hermites, we choose to work in 
Bargmann-Segal space. Define U^^{a) = 'Lot/(a)o'k“^ so that = 

t/^®(a). Here, 'k is the isomorphism defined in (60) and 
jg i/-representation analogue of (61), namely 




= e 



-^{2-Kif-t) 



a 




t + 27ri/\ 



(87) 



It is claimed 

t/^^(5(0))(a)(z) =a(e*^z). (88) 

Indeed, S{9){t, f) = (t cos 0 — 2nf sin 0, (27r)“^f sin 0 + / cos 0) , hence 




V \/2 

= U^^{S{9))-^pl^j^^U^^{S{9)){s.){z), 




( 89 ) 
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as required. The rotationally invariant waveforms in the Bargmann- 
Segal domain are thus those waveforms a(z) which satisfy 

V0 G [0,27t), a(e*®z) = \{9)a{z) (90) 

for some A mapping 6 into T. The only solutions are the Hermite wave- 
forms a{z) = in(z), and scalar multiples of them, for n = 0, 1, • • • . 

It is remarked that if the ordinary rotations R{6) are considered in- 
stead of the dilated rotations S{6), then U[R{9)) turns out to be the 
fractional Fourier transform. Indeed, we know from (53) that U [R{9)) is 
the Fourier transform if 0 = — 7t/2. Thus, U[R{9)^ embeds the Fourier 
transform in a one parameter group. 

5.3 Multiple Waveforms 

This section presents an intriguing observation about how an ambi- 
guity function hypothetically can be made to resemble the ideal delta 
function, thus overcoming the Heisenberg uncertainty principle. 

Assume it was somehow possible to transmit simultaneously but sep- 
arately many different waveforms and to receive the returns 

separately too. What we have in mind is not using different portions of 
the electromagnetic spectrum, which does not separate the waveforms 
in the strict sense we require, but rather to imagine the hypothetical 
situation of multiple universes where the target exists in each one but a 
different waveform can be used to detect it in each universe. 

If the waveforms are orthogonal and have equal energy with total en- 
ergy one, so that ||ujj|p = N~^, then it follows from Moyal’s identity 
that the ambiguity functions Au„,u„(i, /) are orthogonal, and in partic- 
ular, the ambiguity function A{t, f) = ^u„,u„(T /) of th® whole 

system has norm 

N N 

l|.4(«,/)f = ||u„||* = (91) 

n=l n=l 

This shows that there is less volume under the ambiguity surface for a 
given total energy as N increases. Therefore, since yl(0,0) = 1, the total 
ambiguity function can be made to approach the ideal delta function as 
N approaches infinity. 

6. The Wide Band Case 

Our description of the Doppler effect is actually only an approximation 
that works when the radial velocities of moving objects in the scene are 
much smaller than the speed of light, and the transmitted signal has 
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a spectrum in a narrow band around its carrier frequency. This is, of 
course, a valid assumption on most circumstances in radar. However, in 
sonar, which works on similar principles, it can often be the case that 
objects move at a non-trivial proportion of the speed of sound in water 
(around 1500 metres per second). Here it is appropriate to replace the 
“narrow band approximation” by the so-called “wide band theory”. As 
the name implies, this theory is also appropriate when the spectrum 
of the transmit signal is broad, as is the case, for example, where the 
transmit signal is a very short pulse. Such signals are not typical in 
conventional radar systems, but again are in sonar, where the pulse is 
often created by a small explosive charge. 

The effect of Doppler is not, as we have suggested in (2), a shift 
in frequency, but is a dilation of the signal. Thus the return from an 
object moving at a radial velocity (towards the transmitter /receiver) of 
V is, leaving aside the magnitude term. 



ret(t) = w ' at — 



2r 

c + V 



where c is the speed of the wave and 

(1-f) 



a = 



(1 + f) 

This leads, after a rescaling, to an ambiguity function of the form 



(92) 



(93) 



Wu,v(r, a) = \/a / u(t)v(a(t -|- r)) dt. 



(94) 



The relevant representation in the wide band case is the representation 



P(a,b)(v)(t) = ^faw{at + h), v G L^(R), 
of the affine group 

: a>0, 6 gr| 



(95) 



(96) 



with group multiplication given by matrix multiplication. Note that 
P(a,b) is actually a representation on the right, meaning = Pg2Pgi 

for all gi,P 2 G A. Analogous to the narrow band case, the wide band 
ambiguity function can be expressed (after a change of coordinates) as 

Wu,v(a,6) = (u,p(a^b)v) (97) 

and its properties are studied by investigating the representation P(a,b) 
of the affine group. 
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This investigation is not undertaken here for reasons of space. The 
interested reader is, however, referred to (Miller, 1991) for details. It is 
remarked though that the representation theory of the affine group is 
complicated by the affine group not being unimodular. That is to say, 
whereas the left invariant Haar measure defined in (13) of the Heisenberg 
group is also the right invariant measure of the Heisenberg group, the 
left and right invariant measures of the affine group differ. 
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Abstract Clifford’s geometric algebra, in particular the algebra of physical space 
(APS), lubricates the paradigm shifts from the Newtonian worldview 
to the post-Newtonian theories of relativity and qnantnm mechanics. 
APS is an algebra of vectors in physical space, and its linear subspaces 
include a 4-dimensional space of paravectors (scalars plus vectors). The 
metric of the latter has the pseudo-Euclidean form of Minkowski space- 
time, with which APS facilitates the transition from Newtonian me- 
chanics to relativity without the need of tensors or matrices. APS also 
provides tools, such as spinors and projectors, for solving classical prob- 
lems and for smoothing the transition to quantum theory. This lecture 
concentrates on paravectors and applications to relativity and electro- 
magnetic waves. A following lecture will extend the treatment to the 
quantum/classical interface. 



Keywords: algebra of physical space, Clifford algebra, instruction in the quantum 
age, paravectors, quantum/classical interface, relativity. 

Introduction 

Devices such as single-electron transistors and switches, single photon 
masers, and quantum computers are hot topics. Not only is it difficult to 
ignore quantum and relativistic effects in such devices; these effects may 
often be dominant. Indeed, the theory of electromagnetic phenomena is 
inherently relativistic, and impetus behind quantum computation is to 
employ quantum superposition to make a new breed of computer vastly 
superior to current ones for certain tasks. 

Relativity and quantum theory were introduced roughly a century 
ago. They both entail paradigm shifts from the assumptions made by 
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Newton, but our teaching has not fully adapted. Undergraduate in- 
struction still begins with Newtonian mechanics, and only after New- 
ton’s worldview is ingrained does it progress to the more difficult and 
abstract mathematics of quantum theory and relativity. In this lecture, I 
suggest that much of the apparent dichotomy between the mathematics 
of Newtonian mechanics and that of both quantum theory and relativity 
arises because we have not been using the best formulation to describe 
the physics. 

Clifford’s geometric algebra, in particular the algebra of physical space 
(APS), empowers classical physics with geometric tools that lead to a 
covariant formulation of relativity and are strikingly similar to tools 
common in quantum theory. [1, 2] With APS, quantum theory and rel- 
ativity can be taught with the same mathematics as Newtonian physics, 
and this permits an earlier, smoother introduction to post-Newtonian 
physics. This, in turn, encourages students to build intuition consistent 
with relativistic and quantum phenomena and properly prepares them 
for the quantum age of the 21st century. 

The principal purpose of this lecture is to demonstrate that the struc- 
ture and geometry of APS make it a natural and minimal model for 
both Newtonian and relativistic mechanics. It is natural in that it as- 
sociates quantities much in the same way that humans usually do, and 
it is minimal in that it avoids the assumption of additional structure 
that is not relevant to the physics. I start by reviewing the Clifford 
algebras commonly used in physics and their relation to APS, which I 
introduce as an algebra of spatial vectors. However, we quickly note 
that APS contains a 4-dimensional paravector space with the metric 
of spacetime, and it can be used to formulate a covariant approach to 
relativity. Multiparavectors and their Lorentz transformations are also 
discussed and interpreted. The relation of APS to the spacetime algebra 
(STA) is discussed in detail with an emphasis on the difference between 
absolute and relative formulations of relativity. The quantum-like tools 
of eigenspinors and projectors are introduced, along with applications to 
the electrodynamics of Maxwell and Lorentz, including a study of Stokes 
parameters and light polarization. 

1. Clifford Algebras in Physics 

The importance of Clifford algebras to physics and engineering is in- 
creasingly recognized, [3, 4, 5, 6, 7, 8] and most physicists have en- 
countered Clifford algebras in some guise. The three most commonly 
employed in physics are the quaternion algebra H =C?o, 2 , the algebra of 
physical space (APS) Cis, and the spacetime algebra (STA) C?i ,3 . They 
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are closely related. We use the common notation [9] for the Clifford 
algebra of a vector space of metric signature (p, q ) , and Cip = Clp^ . 

When Hamilton introduced vectors in 1843, they were part of an 
algebra of quaternions. [10] The superiority of H for matrix-free and 
coordinate-free computations of rotations in physical space has been re- 
cently rediscovered by space programs, the computer-games industry, 
and robotics engineering. Furthermore, H has been investigated as a 
replacement of the complex field in an extension of Dirac theory. [11] 
Quaternions were used by Maxwell and Tait to express Maxwell’s equa- 
tions of electromagnetism in compact form, and they motivated Clifford 
to find generalizations based on Grassmann theory. 

Hamilton’s biquaternions (complex quaternions) are isomorphic to 
APS: BI(8)C ~ C?3 , familiar to physicists as the algebra of the Pauli spin 
matrices. The even subalgebra C£^ is isomorphic to H over the reals, and 
the correspondences i ^ 0362, j ^ e^e3, k e2ei identify pure quater- 
nions with bivectors in APS. APS distinguishes cleanly between vectors 
and bivectors, in contrast to most approaches with complex quaternions. 
The identification of the volume element 616263 = i (see next section) 
endows the unit imaginary with geometrical significance and helps ex- 
plain the widespread use of complex numbers in physics. [12] The sign 
of i is reversed under parity inversion, and imaginary scalars and vectors 
correspond to pseudoscalars and pseudovectors, respectively. 

APS is also isomorphic to the even part of STA: Ci^ ~ C?i~3- STA is 
familiar as the algebra of Dirac’s gamma matrices, where each matrix 
7^, ^ = 0, 1,2,3, represents a unit vector in spacetime. To be sure, 
Dirac’s electron theory (1928) was based on a matrix representation 
of C?i,3 over the eomplex field, whereas STA, pioneered by Hestenes 
[13, 14, 15] for use in many areas of physics, is C?i,3 over the reals. 

Clifford algebras of higher-dimensional spaces have also been used 
in robotics [16], many-electron systems, and elementary-particle theory 
[17]. This lecture focuses on APS, although generalizations to C£n are 
made where convenient, and one section is devoted to the relation of 
APS to STA. A full study of APS is beyond the scope of this lecture and 
can be found elsewhere [6] , but the algebra is sufficiently simple that we 
can easily present its foundation and structure. 

1.1 APS: an Algebra of Vectors 

To form any algebra, we need elements and an associative product 
among them. The elements of APS are the vectors of physical space 
u, V, w, and all their products uv, uvw, uu, .... If we start with vectors 
in an n-dimensional Euclidean space, then only one axiom is needed to 
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define the algebraic product: the square of any vector u is its square 
length (a real number, a scalar): 



uu = u^= u • u . (1) 

That’s it. This axiom, together with the usual rules for adding and 
multiplying square matrices, determines the entire algebra. 

Let’s put u = V + w. The axiom implies that 

vw + wv = 2v • w . (2) 

Evidently the algebra is not commutative. If v and w are perpendicular, 
they anticommute. Let {ei, e2, • • •} be a basis of orthogonal unit vectors 
in the n-dimensional Euclidean space. Then = 1 and eie2 = 

We can be sure that eie2 doesn’t vanish because it squares to — 1 : 
01626162 = — 1 . The product of perpendicular vectors is a new element 
called a bivector. It represents a directed area in the plane of the vectors. 
The “direction” corresponds to circulation in the plane: if the circulation 
is reversed, the sign of the bi vector is reversed. The bivector replaces the 
vector cross product of polar vectors, but unlike the usual cross product, 
it is intrinsic to the plane and can be applied to planes in spaces of more 
than 3 dimensions. 

Bivectors can also be viewed as operators on vectors. They generate 
rotations and reflections in the plane. To rotate any vector u = u^e\ + 
v?e2 in the 6162 plane by a right angle, multiply it by the unit bivector 
6162 : U6162 = v}e2 — v?ei. The counterclockwise sense of the rotation 
when u is multiplied from the right corresponds to the circulation used 
to define the “direction” of 6162. Multiplication from the left reverses 
the rotation. To rotate u in the plane by an arbitrary angle 4 > multiply 
it by a linear combination of 1 (no rotation) and 6162: 

u (cos 4 > + 6162 sin (/)) = u exp (6i62(/>) 

Note the exponential function of the bivector 6i620. It can be defined 
by its power-series expansion because all powers of bivectors can be 
calculated in the algebra. The Euler-type relation for exp (6i62</>) follows 
from the fact that 6162 squares to —1. 

A general vector v, with components both in the plane and perpen- 
dicular to it, is rotated by the angle (p in the 6162 plane by 



V 



R^rR^ 



( 3 ) 
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where the rotors R, are 

R = exp (-eie2</>/2) = cos - - eie2 sin - ( 4 ) 

t r \\ 4> 4> 

W = cos - - (eie2j ' sin - = cos - - e2ei sin - = i? . ( 5 ) 



The dagger f denotes a conjugation called reversion,^ in which the order 
of vectors in products is reversed. Thus, for any vectors v, w,, (vw)^ = 
wv. The reversion of other elements, say AB, can then be found from 
the rule (AB)^ = BIaI. An element equal to its reversion is said to 
be real, whereas one equal to minus its reversion is imaginary. The 
two-sided spinorial form of ( 3 ) preserves the reality of the transformed 
vector. From ( 5 ), R is unitary and consequently all products of vectors 
transform in the same way ( 3 ). In particular, the bivector eie2 commutes 
with the rotors exp (±eie2(/>/2) and is therefore invariant under rotations 
in the eie2 plane. It is equally well expressed as the ordered product of 
any pair of orthonormal vectors in the plane. 

The trivector 616263 squares to —1 and commutes with all vectors that 
are linear combinations of 61,62, and 63. More generally, products of k 
orthonormal basis vectors Bj can be reduced if two of them are the same, 
but if they are all distinct, their product is a basis /c- vector. In an n- 
dimensional space, the algebra contains (^) such linearly independent k- 
vectors, and any real linear combination of them is said to be an element 
of grade k . Thus, scalars have grade 0 , vectors grade 1 , bivectors grade 
2 , trivectors grade 3 , and so on. 

In APS, where the number of dimensions is n = 3 , the 616263 is the 
highest-grade element, namely the volume element, and it commutes 
with every vector and hence with all elements. It can be identified with 
the unit imaginary: 

616263 = i . (6) 

Note that i changes sign under spatial inversion: ey — 6^, k = 1 , 2 , 3 . 

Imaginary scalars are called pseudosealars because of this sign change. 
Any bivector can be expressed as an imaginary vector, called a pseu- 
doveetor. For example. 



6162=6162(616263)^ = ^63. ( 7 ) 

The center of APS (the part that commutes with all elements) is spanned 
by { 1 , i} and is identified with the complex field. Every element of APS 



tilde “is often used to indicate reversal, but in spaces of definite metric such as Euclidean 
spaces, the dagger is common since it corresponds to Hermitian conjugation in any matrix 
representation in which the matrices representing the basis vectors are Hermitian. 
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is a linear combination of 1 , ei, 62 , 63 , iei, ie 2 , ^ 63 , i over the reals or, 
equivalently, of 1, ei, 02, 63 over the complex field. The element *x = —ix 
is said to be the Clifford- Hodge dualoi x. 

1.2 Existence 

We should check that our algebra exists. It is possible to define struc- 
tures that are not self-consistent. The existence of a matrix represen- 
tation is sufficient to prove existence. The canonical one replaces unit 
vectors by Pauli spin matrices. There are an infinite number of valid 
representations. They share the same algebra and that is all that mat- 
ters. 

2. Paravector Space as Spacetime 

APS includes not only the 3-dimensional linear space of physical vec- 
tors, but also other linear spaces, in particular the 4-dimensional linear 
space of scalars plus vectors. In mathematical terms, this 4-dimensional 
linear space is also a vector space and its elements are vectors, but to 
distinguish them from spatial vectors, we call them paraveetors. A par- 
avector p can generally be written 



P = + P (8) 

where p^ is a scalar and p = k = 1,2,3, a physical vector (the 

summation convention for repeated indices is used). It is convenient to 
put eo = 1 so that we can write 

P = P%, P = 0,l,2,3. (9) 

The metrie of the 4-dimensional paravector space is determined by the 
quadratic form (“square length”) of paraveetors. Since is not generally 
a pure scalar, we need the Clifford eonjugate of p, p = = p^ — p 

since the product pp = pp = (p^) — is always a scalar. If we take pp 
to be the quadratic form of p and use 

(3^)5 = ^ + ^) = (^)s ( 10 ) 

to denote the scalarlike (that is scalar plus pseudoscalar) part of any 
element x, then 

pp = {pp)g = p^^p^'g^^ ( 11 ) 

and the Minkowski metrie of spacetime = (e^§i,)^ arises automat- 
ically. Spacetime can be viewed as paravector space, and spacetime 
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vectors are (real) paravectors in APS. The vector part of the paravector 
is the usual spatial vector, and the scalar part is the time component. 



Example 1 In units with the speed of light c = 1, the quadratie form 
(or “square length”) of the spaeetime displaeement dr = dt + dr is 

dr dr = dt^ — = dt^ (l — v^) , 



where v is the veloeity dr/dt. If we define the dimensionless sealar known 
as the proper time r by dr dr = , then we see that ^dr = dt with 




[ 1 -v^] 



- 1/2 



In partieular, dr = dt in a rest frame of the displaeement. The dimen- 
sionless proper veloeity is 




dt 

dr 




7(1 + v) , 



and by definition it is unimodular: uu = 1. Other spaeetime veetors 
ean be similarly represented as paraveetors in APS. For example, the 
energy-momentum paraveetor of a partiele is p = mu = E + p . 



In his article [18] of 1905 on special relativity, Einstein did not mention 
a spacetime continuum. That was a construction proposed three years 
later by Minkowski. I like to think that Einstein, had he seen it, would 
have appreciated the natural appearance of the spacetime geometry in 
paravector space. 



2.1 Multiparavectors 

From the quadratic form of the sum p + g of paravectors, we obtain 
an expression for the sealar produet of paravectors p and q : 

{pq)s = \ip^ + (ip) ■ ( 12 ) 

One says that paravectors p and q are orthogonal if and only if (pq) 5 = 0 . 
The veetorlike (vector plus pseudovector) part of pq is 

{pq)v ^\{pq- qp) =pq- {pq)s (13) 

and represents the directed plane in paravector space that contains par- 
avectors p and q. It is called a biparaveetor and can be expanded in a 
basis of unit biparavectors {e^eu)y with = ±1 : 

{pq)v =P^q"' (e^e^)y 
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The biparavectors form a six-dimensional linear subspace of APS equal 
to the direct sum of vector and bivector spaces. Since bivectors of APS 
are also pseudovectors, any biparavector is also a complex vector. 

Biparavectors arise most frequently in APS as operators on paravec- 
tors. Thus, the unit biparavector {e^e^)y , fi u, rotates any paravector 
p = ae^ -|- be^, in the plane to an orthogonal direction: 

V P — 2 bGfj^Gi/Giy QjGpG^G^ bGpG^Gp) 

— dTj^^Gy A' bT]i,i,G^. 

In analogy with bivectors, biparavectors generate rotations in paravec- 
tor space. One of the most important biparavectors in physics is the 
electromagnetic field (or “Faraday”) which in SI units with c = 1 can 
be written F = (^dA'jy = E -|- iB , where d = G^d/dx^ is the paravector 
gradient operator and A = (/> -|- A is the paravector potential. We will 
see below how to define F in terms of rotation rate. 

Triparavectors can also be formed. The triparavector subspace of APS 
is four-dimensional, the direct sum of pseudovector and pseudoscalar 
spaces. The volume element of paravector space is the same as that of 
the underlying vector space: 

60616263 = 616263 = i . ( 14 ) 

As seen before, it commutes with all elements of APS. 

2.2 Paravector Rotations and Lorentz 
Transformations 

Rotations and reflections in paravector space preserve the scalar prod- 
uct {pq) s of any two paravectors. Paravector rotations have the same 
spinorial form as vector rotations, 

p LpL\ ( 15 ) 

where L is a unimodular element {LL = 1 ) known as a Lorentz rotor. 
Lorentz rotations are the physical Lorentz transformations of relativ- 
ity: boosts, spatial rotations, and their products. In APS they can be 
calculated algebraically without matrices or tensors. 

Lorentz rotors for spatial rotations are just the same rotors ( 4 ) intro- 
duced above, and those for boosts are similar except that the rotation 
plane in paravector space includes the time axis 6 q. For a boost along 
61 for example, the Lorentz rotor L has the real form 

^ f - w\ , re ■ ^ w rt 

L = exp (^6160— j = cosh — 61 smh — = L' 



(16) 
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where w is a scalar parameter called the rapidity. The Lorentz rota- 
tion ( 15 ) can be calculated directly with two algebraic products, but 
because it is linear in p, it is sufficient to determine the transformed 
basis paravectors 

( 17 ) 

Since the proper velocity in the rest frame is unity, the Lorentz rotation 
of eo = 1 must give the proper velocity u induced by the boost on objects 
initially at rest: 

u = LbqL^ = LL^ = 7 (1 -|- uei) ( 18 ) 

7 = cosh re, -jv = sinhre . ( 19 ) 

Since ei also commutes with the biparavector eieg of the rotation plane 
whereas 62 and 63 anticommute with it, the boost of the paravector basis 
elements gives 

uq = ueo, ui = uei (20) 

U2 = e2, U3 = 63 . 

The set {uq, u^, U2, U3} = {u^} is an orthonormal basis of paravectors 
for the boosted system. It follows that the boost of any paravector 
p = p^Gfj, produces 

LpL^ = p>"vL^ = u (p°eo p^ei) -h p^e2 + p^e^ (21) 

= 7 + vp^) eo + 7 {p^ + vp^) ei -h p^e2 + . 

We can eliminate the dependence on the paravector basis by introducing 
components of p coplanar with the rotation plane 

P^ = P°eo -h 



and perpendicular to it 

p^ = p^02 -I- P^63 = P - P^. 

Then the boost of p is 

p LpL^ = up^ +P'^- (22) 

No matrices or tensors are required, and the algebra is trivial. Algebraic 
calculation of the boost is sufficiently simple to be taught at an early 
stage of a student’s study of physics. Students can perform most calcula- 
tions in introductory relativity texts using no more than the transforma- 
tion ( 22 ) and the basic axiom ( 2 ) of the algebra. A couple of examples 
will illustrate the simplicity of the approach. 
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Example 2 Let Carol have proper veloeity ubc os seen by Bob, and 
let Bob have proper veloeity uab with respeet to Aliee. To find Carol’s 
proper veloeity uac with respeet to Aliee, we boost the paraveetor p = 
Ubc by L = as in (22). If the veetor parts of uab o.nd ubc 

eollinear, uab md ubc tie in the same spaeetime plane and eommute. 
Then = 0 and the transformation (22) reduees to the produet 



uac = uabubc ■ 



By writing eaeh proper veloeity in the form u = 7(1 +v) , one easily 
extraets the usual result for eollinear veloeity eomposition^ 

{uAc)v VAB+Vsc 

VAC = 7 ^ = 7— • 

\UAc)s ^ + ^AB-^BC 

Example 3 Consider the ehange in the wave paraveetor k of a pho- 
ton when its souree is boosted from rest to proper veloeity u. The wave 
paraveetor, like the momentum hk, is null, kk = 0 , and ean be written 
A: = (j + k = w^l + k^ , where the unit veetor k gives the direetion and 

Lv the frequeney of k. From (22), writing out the eoplanar part of k as 
k'^ = Lv kll and noting that k~^ is a veetor, we find 

A: = a; + k^ ^ k' = LkL'^ = u + k^'^ + k"*- 

with « = 7 (1 + v) and k^ = k • v v = k — k"*", where v is the unit vee- 
tor along v. This transformation deseribes what happens to the photon 
momentum when the light souree is boosted. Evidently k-*- is unehanged, 
but there is a Doppler shift in lo and a ehange in k • v : 



uj' = + kl^^ ^ = 7 CU + k • 

k' • V = /nv ("w + k^^') \ = 7 CU ("n + k • v') = uj ' cos O' . 



The ratio k' • v/w shows how the photons are thrown forward 



cos a = 



V + cos 6 
\ V cos 9 
in what is ealled the “headlight” effeet. 



Example 4 The results of the previous example ean be eombined with a 
qualitative deseription of Thomson seattering to explain how high-energy 



^Note that the cleanest way to compose general Lorentz rotations is to multiply rotors: 
La^C — ^AB^BC - See also the section below on eigenspinors. 
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gamma-ray photons are produced near sources of energetic electrons. In 
Thomson scattering, a electron initially at rest scatters an unpolarized 
beam of radiation into all directions. It is the limit of Compton scatter- 
ing when to m, the rest energy of the electron. In the Lab frame, the 
electrons are ultra-relativistic with energies 7m » m, and they collide 
with photons of the 2.7 K blackbody radiation that permeates space. Let 
(jJo be Lab frequency of the background radiation. In the rest frame of 
the electron, this gets Doppler shifted to roughly 'ywo (within a factor 
between 0 and 2 , depending on angle), and Thomson scattering occurs, 
redistributing the photons in all directions. Transforming the scattered 
photons back to the Lab gives a collimated beam of photons in the di- 
rection of the electron velocity with energies of order 'y'^ujQ. Thus, 5 GeV 
electrons (7 = can raise 10 “^ eV photons to MeV energies. 



An attractive feature of this simple algebraic approach to introductory 
relativity is that it is not restricted to such simple cases. Rather it is part 
of an algebra that simplifies computations for all relativistic phenomena. 
Lorentz rotations of multiparavectors, in particular, are readily found 
by putting together those for paravectors. Thus, for the boost along 
6i considered above, the biparavectors eido and 6263 are seen to be 
invariant whereas 6162, 6163, eo62, eQ§3 are multiplied from the left by u. 
More generally, the paravector product pq transforms to LpL\LqL^) = 
LpqL. From this we can confirm that scalar products ( 12 ) of paravectors 
are Lorentz invariant and that any biparavector, say F, transforms as 

F ^ LFL . ( 23 ) 

The power of APS allows us to generalize expression (22) to an arbi- 
trary Lorentz rotation. We start with an arbitrary simple Lorentz rota- 
tion, that is, a rotation in a single paravector plane. Simple rotations in- 
clude all spatial rotations, pure boosts, and many boost-rotation combi- 
nations. We again split p into one component p^ coplanar with the rota- 
tion plane plus another component p^ perpendicular to it: p = p^ -\-p~^. 
Consider a biparavector {uv)y , which represents the plane containing 
the independent paravectors u and v. By expansion it is easy to see that 



{uv)yU = 



uvu — vuu 



= u 



vu — uv 



u (uv) 



t 

V ■ 



Similarly, one finds {uv)yV = v {uv)\y . The component p^ coplanar 
with {uv)y is a linear combination of u and v and therefore obeys the 
same relation 



{uv)yP^ = P^ {uv)\, . 
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On the other hand, the component is orthogonal to u and v, = 

0 = and similarly for u replaced by v. Consequently, 



_ _L UVp^-VUp^ _L , _,t 

{uv)vP = = -p {uv)'y . 

It follows that if L is any Lorentz rotation in the {uv)y plane, then 



P 



A 



V' = Lp^, p-^L^ = Lp^ , 



and the rotation (15) reduces to 

p Lp^L^ + Lp^L^ = L^p'^ + p^ . (24) 

Expression (22) is a special case of this relation for a pure boost. We can 
now generalize the result further to include compound Lorentz rotations. 
Any rotor can be expressed as 

L = ±expQw^, (25) 

and the arbitrary biparavector W can always be expanded as a sum of 
simple biparavectors W = Wi + W2 = (1 + ia) Wi, where a is a real 
scalar. The relation W 2 = idW \ means that W 2 is proportional to 
the dual of Wi so that Wi and W 2 represent orthogonal planes: every 
paravector in Wi is orthogonal to every paravector in W 2 . Since Wi 
and W 2 commute, L can be written as a product of commuting simple 
Lorentz rotations: 



L = L1L2, Li = L2 = 

Every paravector p can be split into one component, pi, coplanar with 
Wi and another, p 2 , coplanar with W 2 . Two applications of the simple 
transformation result (24) gives its generalization 

p LpL^ = L1L2 {pi + P 2 ) l\l\ = L\pi + LIp2 . (26) 

Note that the Lorentz transformations of paravectors p and are the 
same, as are those of biparavectors F and F. However, p and p have 
distinct transformations as do F and Ft. As a consequence, Lorentz 
transformations mix time and space components of p, but not its real and 
imaginary parts, whereas such transformations of F leave F vector like 
but mix vector (real) and bivector (imaginary) parts. 
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3. Interpretation 

Note that in APS, the Lorentz rotation acts directly on paravectors 
and their products, and not on scalar coefficients. Contrast this with 
matrix or tensor formulations where transformations are given only for 
the coefficients. Of course, the two approaches are easily related. For 
example, in the boost (21) of p, if p and the transformed p' = LpL^ are 
expanded in the same basis {e^} , we can simply read off the transfor- 
mation of coefficients: 

= 7 (p° + vp^) 

p'^ = 7 (p^ + vp^) 

12 2/3 3 

P =P , P =P , 

and this is easily cast into standard matrix form. The two approaches are 
therefore equivalent, but the APS approach of transforming paravectors 
is more geometric and does not require Lorentz transformations that 
are aligned with basis elements. Indeed, the algebraic transformation 
p ^ p' = LpL^ is independent of basis. 

A basis is needed only to compare measured coefficients. In general, 
the transformed paravector p' is p' = p'^e^ = p^Uj^ , where as above 
= Le^L^ are the transformed basis paravectors. To isolate individual 
coefficients from an expansion, we introduce reciprocal basis paravectors 
defined by the scalar products 

(e^e^)5 = > 

where d)) is the usual Kronecker delta. The reciprocal paravectors of the 
standard basis elements are = eo,e^ = — e^, k = 1,2,3. With 
their help, we obtain 

p'^ = p" (u,e^)^ = p" (Le.Lte^)^ = T^pT (27) 

3.1 Active, Passive, and Relative 
Transformations 

It is common to distinguish active transformations from passive ones. 
In active transformations, a single observer compares objects in differ- 
ent inertial frames, whereas in passive transformations a single object 
is observed by inertial observers in relative motion to each other. The 
transformations described above were active. However, APS accommo- 
dates both active and passive interpretations. The mathematics is the 
same, as seen from expression (27) for the transformation elements 

TO = ( (Le,Lt) = (e, (Lte^L) = (e^Le^Lt))^ . (28) 
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Here we used the property that the scalar part of a product is inde- 
pendent of the order {xy)g = {yx)g for any elements x,y of APS. The 
first equality of (28) finds the components of the transformed basis par- 
avectors on the original basis, whereas the last finds components of the 
original basis on the inversely transformed basis. These are alternative 
ways to interpret the same expression. 

To see the relation between active and passive transformations more 
explicitly, consider the passive transformation of a fixed paravector from 
one observer, say Alice, to another, say Bob. Let pA = be the 

paravector as seen by Alice in terms of Alice’s standard (rest) basis. 
Bob, who moves at proper velocity u with respect to Alice, will measure 
a different paravector, namely pB = respect to his rest frame. 

Note that the paravector basis in expansions of both pA and pB is the 
same, namely the standard basis {e^} , even though pA is expressed 
relative to Alice and pB relative to Bob. The reason is that the standard 
basis {e^} is relative] it is at rest relative to the observer. To relate pA 
and pb, both must be expressed relative to the same observer. Bob’s 
frame as seen by Alice is 

{u^} = {Le^Lt} (29) 

with uq = u. Thus pA can be written 

PA = Pa^im = Pb^u = LpbL^ , (30) 

which can be inverted to give the passive transformation 

PA ^ PB = LpaL'^ ■ (31) 

This has the same form as the active transformation, but with the inverse 
Lorentz rotor. 

APS uses the same mathematics to find not only passive and active 
Lorentz rotations, but also any mixture of passive and active rotations: 
all that counts is the relative Lorentz rotation of the observed object with 
respect to the observer. This property means that the basis paravectors 
themselves represent not an absolute frame, but rather a frame relative 
to the observer (or Lab). The proper basis {e^} with eo = 1 represents 
a frame at rest with respect to the observer. In APS, as in experiments, 
it is only the relative motion and orientation of the observed object with 
respect to the observer that is significant. 

3.2 Covariant Elements and Invariant Properties 

Experiments generally measure real scalars such as the size of paravec- 
tor components on a given basis. The most meaningful geometric quan- 
tities in relativity, however, are spacetime vectors and products thereof 
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that simply rotate and reflect in paravector space under the action of 
Lorentz transformations. Such quantities are said to be covariant. In 
APS, covariant spacetime vectors are real paravectors, and the bipar- 
avectors and triparavectors formed from them are also covariant, but 
one can move back and forth easily between covariant quantities and 
their components. Individual components are not generally covariant. 
Some properties, such as the scalar product of covariant paravectors and 
the square of simple covariant biparavectors, are invariant, unchanged 
by Lorentz rotations. Such properties are known as Lorentz scalars. 

3.3 Relation of APS to STA 

An alternative to the paravector model of spacetime is the spacetime 
algebra (STA) introduced by David Hestenes [ 13 , 15 ]. APS and STA are 
closely related, and it is the purpose of this section show how. 

STA is the geometric algebra of Minkowski spacetime. Whereas 
the Minkowski spacetime metric appears automatically in APS, it is 
imposed in STA. In each frame STA starts with a 4 -dimensional or- 
thonormal basis {70,71)72,73} = {7/^1 satisfying 

Inlv + Ivin = 

where as previously, are elements of the metric tensor = 

diag ( 1 , — 1 , — 1 , — 1 ) . The volume element in STA is I = 70717273 • Al- 
though it is referred to as the unit pseudoscalar and squares to —1, it 
anticommutes with vectors, thus behaving more like an additional spa- 
tial dimension than a scalar. 

The frame chosen for a system can be independent of the observer 
and her frame {7^} . Any spacetime vector p = expanded in the 
system frame {7^} can be multiplied by observer’s time axis 70 to give 

ino=P-%+P^%= P^^n ) ( 32 ) 

where^ = 7^70 . In particular, uq = 7070 is the proper velocity of 
the system frame {7^} with respect to the observer frame {7^}. If the 
system frame is at rest with respect to the observer, then 70 = 70 and 



double arrow might be thought more appropriate than an equality here, because and 
7 m? 70 act in different algebras. However, we are identifying Cis with the even subalgebra 
of C^i, 3 , so that the one algebra is embedded in the other. Caution is still needed to avoid 
statements such as 

wrong! „ ^ ^ ^ ^ 

I = 616263 = ei A 62 A 63 = 7 i A 70 A 72 A 70 A 73 A 70 = 0 . 

This is not valid because the wedge products on either side of the third equality refer to 
different algebras and are not equivalent. 
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the relative basis paravectors are replaced by proper basis elements 
e^. The result 

P7o = = pO 

where eo = 1 and = 7^70, is called a space/time split. The asso- 
ciation 7^70 = when 70 = 70 establishes the previously mentioned 
isomorphism between the even subalgebra of STA and APS. Together 
with = 'Jii% for a more general system basis, it emphasizes that the 
basis vectors in APS are relative', they always relate two frames, the 
system frame and the observer frame, each of which has its own basis in 
STA. 

Clifford conjugation in APS corresponds to reversion in STA, indi- 
cated by a tilde. For example, = (7/^70) = 7o7/i • In particular, 

the proper velocity of the observer frame {7;^} with respect to 70 is 
uq = 7o7o ) the inverse of uq = 70% • It is not possible to make all of the 
basis vectors in any STA frame Hermitian, but one usually takes 7J = 70 
and 7^ = — % in the observer’s frame {7;^} . More generally, Hermitian 
conjugation of an arbitrary element T in STA combines reversion with 
reflection in the observer’s time axis 70: Pt = 7oP7o • For example, the 
relation 

u), = 70 (7 m7o) 70 = = U;. 

shows that all the paravector basis vectors are Hermitian. It is im- 
portant to note that Hermitian conjugation is frame dependent in STA 
just as Clifford conjugation of paravectors is in APS. 

Example 5 The Lorentz- invariant sealar part of the paraveetor produet 
pq in APS has the same expansion as in STA: 

{Pd)s = 

= iltilololu + 7 i ^ 7 o % 7 m ) 

Basis biparaveetors in APS beeome basis biveetors in STA: 

^ (e^e^ - e^e^) = ^ (7^^7071^ - lulololfi) 

= ^ - Ivin) ■ 

Lorentz transformations in STA are effected by 7^ ^ A7^L, with 
LL = 1 . Every product of basis vectors transforms the same way. An 
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active transformation keeps the observer frame fixed and transforms only 
the system frame. Suppose the system frame is related to the observer 
frame by the Lorentz rotor L : 7^ = . Then 

= Aplo = ( 70 -^ 70 ) = 

which coincides with the relation in APS. In a passive transformation, 
it is the system frame that stays the same and the observer’s frame that 
changes. Let us suppose that the observer moves from frame {7^} to 
frame {7^} where 7^ = L^fj,L. Then 

Gf, = 7/^70 ^ . 

To re-express the transformed relative coordinates in terms of the 
original , we must expand the system frame vectors 7^ in terms of the 
observer’s transformed basis vectors 7^. Thus 

= L7^L7o = 

The mathematics is identical to that for the active transformation, but 
the interpretation is different. It is important to stress that the system 
and observer frames are distinct in STA, and under active or passive 
transformations, only one of them changes. Confusion about this point 
can easily lead to errors in the transformation properties of elements and 
their relation space/time splits. 

Since the transformations can be realized by changing the observer 
frame and keeping the system frame constant, the physical objects can 
be taken to be fixed in STA, giving what is sometimes referred to as an 
invariant formulation of relativity. Note, however, that the name “in- 
variant” for covariant objects is consistent only if no active Lorentz trans- 
formations are needed. In order to avoid inconsistency and to prevent 
confusion of covariant expressions with Lorentz scalars such as scalar 
products of spacetime vectors, I prefer to call STA an absolute-frame 
formulation of relativity. 

We have seen that a Lorentz rotation has the same physical effect 
whether we rotate the object forward or the observer backward or some 
combination. This is trivially incorporated in APS where only the ob- 
ject frame relative to the observer enters. The absolute frames of STA, 
while sometimes convenient, impose an added structure not required by 
experiment. 

The space/time split of a property in APS is simply a result of ex- 
panding into vector grades in the observer’s proper basis {e^}: 

p = -\-p 

F = E -F iB . 
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Although p and F are covariant, the split is not; it is valid only in a 
rest frame of the observer. To relate this to the split as seen in a frame 
moving with proper velocity u with respect to the observer, we expand 
p in the paravector basis {u^ = and F in the corresponding 

biparavector basis {(u^Uiy)yj , where u = uq. The passive transforma- 
tion of the observer to the moving frame replaces by e^, and the 
new space/time split simply separates the result into vector grades. The 
physical fields, momenta, etc. are transformed and are not invariant in 
APS but covariant, that is, under a Lorentz transformation, the form of 
equations relating them remains the same even though the vectors and 
multivectors themselves change.^ 

STA uses the metric of signature (1,3) , but the pseudoEuclidean met- 
ric of signature (3, 1) could equally well have been used. This alterna- 
tive uses the real Clifford algebra C? 3 q in place of STA’s Because 

the two algebras are inequivalent, there is no simple transformation be- 
tween them, and some authors have debated the relative merits of the 
two choices. APS easily accommodates both possibilities. Our formu- 
lation above gives a paravector metric of signature (1,3) , but simply 
by changing the overall sign on the definition of the quadratic form (or, 
equivalently, of the Clifford conjugate), we obtain a paravector space of 
the other signature, namely (3, 1) . 

STA and APS seem equally adept at modeling relativistic phenomena. 
This at first is surprising since STA has 2^ = 16 linearly independent 
elements whereas APS has only half that many. To understand how APS 
achieves its compactness, note that Lorentz scalars are grade 0 objects 
in both STA and APS, but spacetime vectors in STA are homogeneous 
elements of grade 1 whereas in APS they are paravectors, which mix 
grades 0 and 1. APS maintains a formal grade distinction between an 
observer’s proper time axis and spatial directions by making time a 
scalar and spatial direction a vector. Furthermore, spacetime planes are 
represented by elements of grade 2 in STA and by biparavectors, which 
combine elements of grades 1 and 2, in APS. Elements of a given grade 
evidently play a double role in APS. Rather than being a disadvantage, 
however, the double roles mirror common usage. 

Por example, the spacetime momentum of a particle at rest has only 
one nonvanishing component, namely a time component equal to its 
mass (c = 1), but the mass is also a Lorentz scalar giving the invariant 
“length” of the momentum. In APS, the mass is simply a scalar that 
can fill both roles. In STA, the two roles are represented by expressions 



"^You can have absolute frames in APS, if you want them for use in passive transformations, 
by introducing an absolute observer. 
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of different grades: the Lorentz-invariant mass is a scalar while the rest- 
frame momentum is a vector. They are not equal but are instead related 
by a space/time split, which requires multiplication by 70 . 

Another example is provided by the electromagnetic field F, which 
for a given observer reduces to the electric field E if there is no magnetic 
part. In APS, F is a biparavector with a vector part E and a bivector 
part icB for any given observer. There is no problem in identifying E 
both as a spatial vector and a spacetime plane that includes the time 
axis. In STA, on the other hand, the two choices require different nota- 
tion. Since F is an element of grade 2 , we either specify E as F -70 or as 
this times 70. The expression F -70 has the form of a spacetime vector in 
STA, but of course E transforms differently. The correct transformation 
behavior of F-yo is obtained in STA if, as discussed above, one distin- 
guishes between the observer frame {7^} and the system frame {7^} and 
applies the Lorentz rotation to only one of them. 

The double-role playing of vector grades in APS is responsible for 
its efficiency in modeling spacetime. STA requires twice as many de- 
grees of freedom to model the same phenomena. This appears to be the 
cost of having an absolute-frame formulation of relativity. Both STA 
and APS easily relate a covariant representation to observer-dependent 
measurements, although the connection is more direct in APS. 

4. Eigenspinors 

The motion of a particle is described by the special Lorentz rotor 
L = A that transforms the particle from rest to the lab. Any property 
known in the rest frame can be transformed to the lab by A, which is 
known as the eigenspinor of the particle and is generally a function of 
its proper time r. For example, the spacetime momentum in the lab is 
p = AmAb The term “spinor” refers to the form of a Lorentz rotation 
of A, namely 

A ^ LA, ( 33 ) 

which is the form for the composition of Lorentz rotations but is distinct 
from Lorentz rotations of paravectors and their products. 

The eigenspinor A (r) is the solution of a time evolution equation in 
the simple linear form 

f-A=if!A(r). (34) 

where Li is the spacetime rotation rate of the particle in the lab. This 
approach offers new tools for classical physics. It implies that 



p = AmA'I' -|- Amk^ = (Lip)jjj . 



( 35 ) 
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For the motion of a charge e in the electromagnetic field F = E + zB, 
the identification 

Ct = — F, (36) 

m 

when substituted into (35), gives the covariant Lorentz-force equation. 
Note that (36) can be taken as a covariant definition of F, valid in any 
inertial frame. It is trivial to find A for any uniform field F: 

A(r) = exp(^^^ A(0). (37) 

Solutions can also be found for relativistic charge motion in plane waves, 
plane- wave pulses, or plane waves superimposed on static longitudinal 
electric or magnetic fields. [19] 

5. Maxwell’s Equation 

Maxwell’s famous equations are inherently relativistic, and it is a 
shame that so many texts treat much of electrodynamics in nonrelativis- 
tic approximation. One reason given for introducing relativity only late 
in an electrodynamics course is that tensors and or matrices are required 
which makes the presentation more abstract and harder to interpret. In 
APS we can easily display and exploit relativistic symmetries in simple 
vector and paravector terms without the need of tensors or matrices. 

Maxwell’s equations were written as a single quaternionic equation by 
Conway (1911), Silberstein (1912), and others. In APS we can write 

5F = noj , (38) 

where /zq = eg ^ = dvr x 30 Ohm is the impedance of the vacuum, with 
3 = 2.99792458. The usual four equations are simply the four vector 
grades of this relation, extracted as the real and imaginary, scalar like and 
vectorlike parts. It is also seen as the necessary covariant extension of 
Coulomb’s law V-E = p/eq. The covariant field is not E but F = E-|-iB, 
the divergence is part of the covariant gradient 5, and p must be part of 
j = p — j ■ The combination is Maxwell’s equation.^ 

It is a simple exercise to derive the continuity equation {dj)g = 0 
in one step from Maxwell’s equation. One need only note that the 
D’Alembertian dd is a scalar operator and that {F)g = 0. 



®We have assumed that the source is a real paravector current and that there are no con- 
tributing pseudoparavector currents. Known currents are of the real paravector type, and 
a pseudoparavector current would behave counter-intuitively under parity inversion. Our 
assumption is supported experimentally by the apparent lack of magnetic monopoles. 
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5.1 Directed Plane Waves 



In source-free space (j = 0), there are solutions F (s) that depend 
on spacetime position only through the Lorentz invariant s = {kx) g = 
wt — k • X, where fc = u;-|-k/0isa constant propagation paravector. 
Since d {kx)g = k, Maxwell’s equation gives 

BF = fcF' (s) = 0 . (39) 



In a division algebra, we could divide by k and conclude that F' (s) = 0, 
a rather uninteresting solution. There is another possibility here because 
APS is not a division algebra: k may have no inverse. Then k has the 
form /c = u; -|- k^ , and after integrating (39) from some sq at which 

F is presumed to vanish, we get fl — k')F(s) = 0, which means F (s) = 



kF (s) . The scalar part of F vanishes and consequently ^kF(s)y = 

k • F (s) = 0 so that the fields E and B are perpendicular to k and 
thus anticommute with it. Furthermore, equating imaginary parts gives 
iB = kE and it follows that 



F = E + iB = (^1 + k) E (s) 



(40) 



with E = (F)sjj real. This is a plane- wave solution with F constant on 
spatial planes perpendicular to k. Such planes propagate at the speed 
of light along k. In spacetime, F is constant on the light cone k • x = t. 
However, F is not necessarily monochromatic, since E (s) can have any 
functional form, including a pulse, and the scale factor ui, although it 
has dimensions of frequency, may have nothing to do with any physi- 
cal oscillation. The structure of the plane wave F is that of a simple 
biparavector representing the spacetime plane containing both the null 
paravector 1 -|- k and the orthogonal direction E. This structure ensures 
that F itself is null: F^ = 0 . In fact, F is what Penrose calls a null 
flag. The flagpole 1 -|- k lies on the light cone in the plane of the flag 
but is orthogonal to both itself and the flag. This is the basis of an 
important symmetry that is critical for determining charge dynamics in 
plane waves. [19] The null-flag structure is beautiful and powerful, but 
you miss it entirely if you only write out separate electric and magnetic 
fields. The electric and magnetic fields are simply components of the null 
flag; E gives the extent of the flag perpendicular to the flagpole, and B 
represents the spatial plane swept out by E as it propagates along k. 

The electric field E (s) determines the polarization of the wave. If 
the direction of E is constant, for example E (s) = E (0) cos s, the wave 
is linearly polarized along E (0) . If E rotates around k, for example 
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E (s) = E (0) exp , k = ±1, the wave is circularly polarized with 

helicity k. Note that the flagpole can gobble the unit vector k : 

F (s) = + k^ E (0) exp + k^ exp f/tks^ E (0) 

= + k^ E (0) exp (— zKs) . (41) 

This establishes an equivalence for null flags between rotations about the 
spatial direction k of the flagpole and multiplication by a phase factor. 
Since the dual of F is — iF, the phase factor is said to induce duality 
rotations. The energy density £ (eoE^ + B^/^o) and the Poynting 
vector S = E X B//U 0 for the plane wave are combined in 

^eoFFt =£ + S = eoE^ (l + k) • 

5.2 Polarization Basis 

The application of APS to the polarization of a beam of light or 
other electromagnetic radiation gives a formulation vastly simpler than 
the usual approach with Mueller matrices, and it demonstrates spinorial 
type transformations and additional uses of paravectors. Furthermore, 
the mathematics is the same as used to describe electron polarization, 
which I discuss in my next lecture. 

The field F of a beam of monochromatic radiation can be expressed as 
a linear combination of two independent polarization types. Both linear 
and circular polarization bases are common, but a circular basis is most 
convenient, partially because of the relation noted above between spatial 
and duality rotations. Circularly polarized waves also have the simple 
form used popularly by R. P. Feynman [20] to discuss light propagation 
in terms of a rotating pointer that we can take to be E(s). A linear 
combination of both helicities of a directed plane wave gives 

F= (l + kj Eoe*'^*" 

= (l + k) Eoe-*^ (E+e-*^ + E_E*) , 



where E± are the real field amplitudes, 5 gives the rotation of E about 
k at s = 0 from the unit vector Eq, and in the second line, we gobbled 
k’s. Because every directed plane wave can be expressed in the form 
F = A + k') E (s) , it is sufficient to determine the electric field E (s) = 
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(F)« : 






E — ( ( 1 + k ) Eo-E+e 



—iS —i 



1 + k ) Eo^+e“*^ + Eo ( 1 + k ) E_e 



+ 1 + k EoE_e 



A5 



—iS 



= ((e+,e-)^e , 



where the complex polarization basis vectors e± 

null flags satisfying e- = e\_, ■ e\_ = 1 = e- 

spinor 



^ = V2 




= 2 ± k^ Eq are 

• el, and the Poincare 

(42) 



gives the (real) electric-field amplitudes and their phases, and it contains 
all the information needed to determine the polarization and intensity 
of the wave. The spinor (42) is related by unitary transformation to the 
Jones vector, which uses a basis of linear polarization. 



5.2.1 Stokes Parameters. To describe partially polarized light, 
we can use the coherency density, [6] which in the case of a single 
Poincare spinor is 

p = eo4>4>'^ = p'^cTf, , (43) 

where the are the Pauli spin matrices. The normalization factor 
£q has been chosen to make the time-averaged energy density. The 
coefficients p^ = (pcr^)^ are the Stokes parameters. The coherency den- 
sity p = p^ + p is a paravector in the algebra for the space spanned by 
{<Ti, <T2, 0-3} . This space, called Stokes space, is a 3-D Euclidean space 
analogous to physical space. It is not physical space, but its geometric 
algebra is isomorphic to APS, and it illustrates how Clifford algebras 
can arise in physics for spaces other than physical space. 

As defined for a single <h, p is null (det p = pp = 0) and can be written 
p = p^ {1 + n) , where n, a unit vector in the direction of p, specifies the 
type of polarization. In particular, for positive helicity light, n = <73, 
for negative helicity polarization n = — <73, and for linear polarization 
at an angle J = c/>/2 with respect to Eq, n = <7i cos <j) + cr 2 sin (f. Other 
directions correspond to elliptical polarization. 



5.2.2 Polarizers and Phase Shifters. The action of ideal 
polarizers and phase shifters on the wave is modeled mathematically 
by transformations on the Poincare spinor of the form ^ T$. For 
polarizers T that pass polarization of type n, T is the projector T = Pn = 
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^ (1 + n) , which is proportional to the pure state of that polarization. 
Projectors are idempotent (P„ =Pn); just as we would expect for ideal 
polarizers since a second application of Pn changes nothing further. The 
polarizer represented by the complementary projector P„ annihilates Pn: 
PnPn = P-nPn = 0, and in general, opposite directions in Stokes space 
correspond to orthogonal polarizations. 

If the wave is split into orthogonal polarization components (±n) and 
the two components are given a relative phase shift of a, the result is 
equivalent to rotating p by a about n in Stokes subspace: T = + 

Pne~*“/^ = Depending on n, this operator can represent both the 

Faraday effect and the effect of a birefringent medium with polarization 
types n and — n corresponding to the slow and fast axes, respectively. 

5.2.3 Coherent Superpositions and Incoherent Mixtures. 

A superposition of two waves of the same frequency is coherent because 
their relative phase is fixed. Mathematically, one adds spinors in such 
cases: + <^2 , where the subscripts refer to the two waves, not 

to spinor components. Waves of different frequencies have a continually 
changing relative phase, and when averaged over periods large relative 
to their beat period, combine incoherently: p = p\ + P 2 - The degree 
of polarization is given by the length of p relative to po and can vary 
from 0 to 100%. Any transformation T of spinors, ^ Td>, transforms 
the coherency density by p ^ T pT\ and transformations that do not 
preserve the polarization can also be applied to p. [6] 

6. Conclusions 

The multiparavector structure of APS makes the algebra ideal for 
modeling relativistic phenomena. It presents a covariant formulation 
based on relative motion and orientation but provides a simple path to 
the spatial vectors for any given observer as well as to the operators 
that act on the vectors. The formulation is simple enough to be used 
in introductory physics courses, and it holds the promise of becoming 
a key factor in any curriculum revision designed to train students to 
contribute significantly to the quantum age of the 21st century. 

Since APS is isomorphic to complex quaternions, any calculation in 
APS can be repeated with quaternions taken over the complex field. 
However, the geometry is considerably clearer in APS. In its descriptive 
and computational power for relativistic physics, APS seems as capable 
as STA. However, STA has twice the size of APS in order to add the 
non-observable structure of absolute frames to its formulation. 
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Abstract In this paper we develop a new unified approach to the so-called gener- 
alized Fourier- Clifford- Prometheus sequences, transforms (FCPTs) and 
M-channel Filter Banks. It is based on a new generalized FCPT-gen- 
erating construction. This construction has a rich algebraic structure 
that supports a wide range of fast algorithms. 

Keywords: Clifford algebra, filter banks, Golay-Shapiro sequences, Fourier-Clifford- 
Prometheus transforms. 



1. Introduction 

The basis which has come to be known as the Prometheus Orthonor- 
mal Set (PONS) was introduced in [1] to prove the H.S. Shapiro global 
uncertainty principle conjecture. Each function in PONS is called a 
Golay-Shapiro sequenee. They are defined on [0,1], piecewise ±1 and 
can change sign only at points of the form j/2”, j = 0, 1, . . . , 2”^ — 1, 
n = 1,2,.... These basis functions satisfy almost all standard proper- 
ties of the Walsh functions. Discrete classical Fourier-Prometheus Trans- 
forms (FPT) in bases of different Golay-Shapiro sequences can be used in 
many signal processing applications: multiresolution by discrete orthog- 
onal wavelet decomposition, digital audio, digital video broadcasting, 
communication systems (Orthogonal Frequency Division Multiplexing, 
Multi-Code Code-Division Multiple Access), radar, and cryptographic 
systems. 

Golay-Shapiro (GS) 2-complementary (±l)-valued sequences associ- 
ated with the cyclic group Z2 were introduced by Shapiro and Golay 
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in 1949-1951 [2]-[7]. In 1961, Golay [3] gave an explicit construction 
for binary Golay complementary pairs of length 2 ”^ and later noted [4] 
that the construction implies the existence of at least 2 ”^m !/2 binary 
Golay sequences of this length. They are known to exist for all lengths 
N = 2“10^26'>', where a,/3 , 7 are integers and a,/3 , 7 > 0 [ 8 ], but do 
not exist for any length N having a prime factor congruent to 3 modulo 
4 [9]. Budisin [10] using the earlier work of Sivaswamy [11], gave a 
more general recursive construction for Golay complementary pairs and 
showed that the set of all binary Golay complementary pairs of length 2 ™ 
obtainable from it concides with those given explicitly by Golay [3] . For 
a survey of results on nonbinary Golay Gomplementary pairs, see [ 12 ]- 
[13]. Recently, Davis and Jedwab [14], combining results appearing in 
the work of Golay and Shapiro cited above, gave an explicit description 
of a large class of Golay complementary sequences in terms of certain 
cosets of the first order Reed-Muller codes. The following general ele- 
ments are used for building the classical Fourier-Prometheus transforms 
in bases of classical Golay-Shapiro sequences: 1) the Abelian group Z 2 , 
2) the 2-point Fourier transform .T 2 , and 3) the complex field C; i.e., 
these transforms are associated with the triple (Z 2 ,.T 2 ,C). 

The multiresolution analysis (MRA) operates upon a discrete signal 
x{l) of length 2”, where n is an integer. The sequence x{l) is convolved 
with two filters L and H. Each convolution results in a sequence half the 
length of the original sequence. The result from the convolution with 
the low-pass filter is again transformed. Each re-transformed sequence 
of the low-pass output is referred to as a dilation. Eor a sequence x{l) of 
length 2”, a maximum of n dilations can be performed. MRA applied 
to a real-valued sequence x{l) is defined recursively by the equations: 

c^P\l) = L = H , 

where p = n, n — 1 , . . . , 1 , 0 , c^{l) = x{l), and 

2 ’"-! 

c{l) = {Lx){l) = Y, kip{l-2k)x{l), 

1=0 

2 "-l 

d{l) = {Hx){l) = Y khp{l - 2k)x{l) 

1=0 

are low-pass and high-pass filters, respectively. 

The sequences c^\l) and d^P\l) are called the “averages” and “dif- 
ferences” of the original signal. The inverse discrete wavelet transform 
reconstructs = x{l) using the recursive algorithm 

c(p+^){l) = L*{c^P\l)] + H*{d^P\l)], 
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where L* and H* are the inverse filters of L and H, respectively. All fil- 
ters L, H, and L* , H* , satisfy the following equation LL* = I, HH* = 

1, and 

LL* + HH* = 21, LH* = H*L = 0, (1) 

where I and 0 denote the identity and zero operators. Note that a pair of 
filters having these properties required of the transformations L and H 
are known as quadrature mirror filters, having the perfect reconstruction 
property. 

The conditions (1) can be rewritten in terms of the .^-transform as 

\kip{z)\‘^ -h \khp{z)\‘^ = 2, kip{z)kip{-z) + khp{z)khp{-z) = 0, Vz G Ti 

where Ti is the unit circle of the complex field C. These conditions mean 
that impulse responses kip{l) and khp{l) form a Golay-Shapiro (GS) 2- 
complementary pair. 

In this paper we develop a new unified approach to the so-called 
generalized Fourier- Clifford- Prometheus (FGP) sequenees, FGP trans- 
forms (FGPTs), and M-ehannel Filter Banks. We describe the precise 
theoretical and computational relationship between M-band wavelets, 
M-channel filterbanks and generalized Golay-Shapiro sequences. The 
approach is based on a new generalized FGPT-generating construction. 
This construction has a rich algebraic structure that supports a wide 
range of fast algorithms. This construction is associated not with the 
triple ^Z 2 , .7-2,0^, but rather with other groups instead of Z 2 , other 

unitary transforms instead of J- 2 -, and other algebras (Glifford algebras) 
instead of the complex field C. 

2. New construction of classical and 
multiparametric Prometheus transforms 

We begin by describing the original Golay 2-complement ary (±1)- 
valued sequences. 

Definition 1 Let p(t) := (po,Pi, • • • q(i) := (go, gi, • • • , 

where Pi,qi G {±1}- The sequenees p{t),q{t) are ealled a 2-eomplemen- 
tary {±l)-valued or Golay eomplementary pair over {±1} if 

GOR[p,p](r) -hGOR[q,q](r) = N5{t), 

or 

|p(^)P + |q(2:)P = iV, VzGTi, 

where GOR[f, f](r) is the periodie eorrelation funetion of f(t); p(z) and 
q{z) are Z -transforms ofp{t) andq{t), respeetively. Any sequenee whieh 
is a member of a Golay eomplementary pair is ealled a Golay sequenee. 




392 



COMPUTATIONAL NONCOMMUTATIVE ALGEBRA 



The Fourier-Prometheus matrix of depth n has size 2"' x 2” : IFV 2 ^ = 
[Pr„(t)]^ Q. For a and t we shall use binary representations a = a^n] ■= 
(err , ^ 2 ? • • • ; ^n) ? ^ ^[A\ * (^ 1 T 2 ; * • • ; ^n) ? where cr^ G (^0,1}, i 

1, 2, . . . , n. Obviously, a[r] = (cri), erp] = (cri, 02 ), ct[ 3 ] = (cri; ct 2 , era); • • • 
*[i] = ih), t[ 2 ] = {ti,t 2 ), t[n] = {h,t 2 , ■ . .,tn), .... For this reason, 

Pl'(0,0,...,0,0)(^l) ■ ■ ■ itn) 

Pr(0,0,...,0,l)(^l; ■ ■ ■ lin) 

[n-l]=0 _^^(«[n-l]d)(^)_ 

■ ■ ■ itn) 






P^(0,0,...,1,0)V^1) • • • ) ^n) 
PJ^(0,0,...,1,1)(^1; ■ ■ ■ itn) 



where Pr(^j^_y o)(^) and Pr(„j^_^j r)(^) are a pair of GS 2-complementary 
sequences and ffl represents the vertical concatenation of matrices. 

The classical matrix IFV 2 ^ is formed by starting with the (2 x 2)- 



matrix TV 



Pro(i) 

Pri(t) 



1 1 
1 -1 



and by repeated application 



of the PONS-iteration construction to pairs of rows in the matrix. 



In the (n -|- l)st iteration this construction takes each pair 



P 

q 



■ Pr(a[„_y,o)(i) 
. PrO[„-i],i)W 



of 



2" TT-p 

' («[„_!], Qn) 



O[„_l]=0 L P’-(«[n-l]d)(^) _ 



and constructs four rows of twice the length 
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where are dyadic shifts. Using this construction for all 2 

complementary pairs (a[fc_ 2 j = 0, 1, . . . , 2^“^ — 1), we obtain 



fc -2 



nn+l 






= m2"-i 



(«[„],«„+!) = ^ [^2 



“[ n ]=0 



“[ n ]=0 



^2 






-1) 


, we 


p 


- 


*- 


q . 


in - 1 ] 





(2) 



Repetition of this construction yields the Fourier-Prometheus matrix 
of size 2^+1 X 2^+b 

Our new PONS construction uses in (2) three parametric unitary 
matrices 



U2{P,(p,u) 



g*(d+7) cos (f l) sin 

g-*(/t-7) sin {p — cos 



instead of T 2 '■ 



2"+l 



TV 



ff[n] )fin+l) (Z^'^+l 5 Pn-\-l ; b^+1 ) 



□ 2"-l 

^ar„i=0 



'n+1 ) V^n+l ) 7n+l ) 



PRaf„_ll,0)(i|/l,<^n,7n) 






Pr(a[„_l],l)(^l/3n,<^n,7n) _ 



-‘2 



( 3 ) 



where 



/^n+l — (/3l) • • • ) /3n+l)) V^n+l — • • • ; Pn-\-l)i Tn+1 — (T1) • • • ) 7n+l) 

are three (n + 1)D vectors of parameters. Extra parameters 
{k = l,2,...,n + l) are changed from stage to stage in this construc- 
tion. The resulting matrix still has orthogonal rows and every pair is 
2-complement ary in the Golay-Shapiro sense. 



3. PONS associated with Abelian groups 
3.1 Abelian groups 

A natural generalization of a 2-complement ary Golay pair is an N- 
complementary Golay N-member orthogonal set of Glifford-valued se- 
quences po{t ), ..., pAT_i(t), where t = 0, 1, ..., — 1. 

Definition 2 Let po(t), pi(t), . . . , pat-i(^) be an N-member orthogo- 
nal set of Clifford-valued sequenees, where Pi{t) G 
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g 27 ru/TV g ^ Clifford algebra, and u is an appropriate bivee- 

tor with the property = — 1 . The sequenees {Pi(i)}^o^ ealled 
N-eomplementary -valued sequenees of length if 

COR[po, PoRt") + . . . + COR[pat-i, pat-iRt) = N^ 5 {t), 

or |po(^)R + |pi(^)R + • • • + |pat-i(2;)R = N^, Vz G Ti, where pRz) 
are Z -transforms of pi{t), i = 0 , 1 , . . . , N — 1 , respeetively. 



Let, for example, N = 3 . Then for the group Z3 we define the Fourier- 
Clifford-Prometheus transform as the Fourier-Clifford transform 



:= 



Pro(i) 




'll 1 ■ 


Pri(t) 


= 


1 63 ei 


_ Pr2(t) _ 




_ 1 e| 63 _ 



For the group Z| we define the Fourier-Clifford-Prometheus transform 
using the classical PONS-construction (2) by 



TCV. 



a\ ,ct2 




’ Pro(i) 








Pri(t) 








Pr 2 (t) _ 



rpai 

-'a 



1 1 1 
1 £3 
1 6| 63 








'll 1 


1 63 6| 


1 6| 63 




1 1 1 
1 63 6| 
1 6^ 63 




1 el 63 


1 1 1 


1 63 6| 






1 1 1 

1 63 el 
1^3 ^ 3 . 


1 63 6§ 


1 6§ 63 


1 1 1 



■ 1 


1 


1 


1 


£3 




1 




£3 




' Pl'(0,0)W ‘ 


1 


1 


1 


£3 




1 




£3 


1 




Pl'(0,l)W 


1 


1 


1 




1 


£3 


£3 


1 






Pl'(0.2)(i) 


1 


^3 


£3 


1 


1 


1 


1 


£3 






Pl'(l,0)W 


4 


£3 


1 


1 


1 


1 


£3 




1 


= 


Pr ( i , i )( t ) 


£3 


1 


^3 


1 


1 


1 




1 


£3 




P ^( i , 2 )( i ) 


1 


£3 




1 


^3 


£3 


1 


1 


1 




Pr ( 2 , o )( i ) 


£3 




1 


£3 


£3 


1 


1 


1 


1 




P ^( 2 , i )( t ) 


. ^3 


1 


£3 


£3 


1 


£3 


1 


1 


1 




. P ^( 2 , 2 )( i ) . 



where are 3 -cyclic shift operators. After n + 1 iterations we 

obtain the following Fourier-Clifford-Prometheus transform on the group 
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The same expression is true for the Fourier-Clifford-Prometheus trans- 
form on the group : 




gjvW-l (jrc^ 

O [„]=0 ^ 




where TCpj is the Fourier-Clifford transform on the group Z^r, 



r rpcti \N— 1 
Sai=0 

are A^-cyclic shift operators. 

3.2 Abelian groups Zjvi © Zyva 0 ■ • • © Zat„ 

Let Zjvj © Zat 2 © . . . © Zjv^ be an Abelian group, where , A"2 , • • • , 
are positive integers. The classical Fourier-Prometheus transforms are 
generated by the Fourier- Walsh transform T2 and by dyadic shifts. Fou- 
rier-Clifford-Prometheus transforms associated with Z^ are generated 
by the Fourier-Clifford transform of the group Zjv and by N-ary 
shifts. We shall generate new Fourier-Clifford-Prometheus transforms 
associated with Abelian groups Z^r^ © Z^Tj © . . . © Zat„ © Zat„^j by using 
the set of Fourier-Clifford transforms iFC N i , jvj , . . . , ■, n„+i ■ 

For example, the group Z2 © Z3 © Z4 requires three Fourier-Clifford 
transforms 




^3 



1 £3 ei , 
_ 1 £3 _ 



^4 



'll 1 1 ■ 

1 ei e \ £4 

1 £| 1 e \ 

1 © 1 

1 



1 1 
1 -1 
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Let us consider the group Z2 © Z3. IFCV2 = -^^2 



Pro(t) 

Pri(t) 



^ ^ . We define the Fourier-ClifFord-Prometheus transform asso- 

ciated with the Abelian group Z2 © Z3 by using the classical PONS 
construction 



‘^‘^TCV 



(01,02) 



= I 



Pp(oi,0))2W 






\ 




PP(01,1))2W 










Pr{(oi, 2 )} 2 (i) . 


) 



where ((ai,/32))2 := («i,/?2) mod 2 . Therefore, 



TCV2-Z 



■f 1 1 

1 63 e| 
1 e| 63 






■ 1 1 


1 -1 


1 1 




1 1 1 






1 1 






1 63 6| 








1 -1 




1 6| 63 _ 




1 -1 







1 


1 


1 


-1 


1 


1 ■ 




’ Ppo,o)(i) 


1 


1 


£3 


£3 








Pl'(0,l)W 


1 


1 


£3 


-£3 








Ppo,2)(i) 


1 


-1 


1 


1 


1 


-1 




Ppl,0)(i) 


£3 


-ei 


1 


1 


£3 


-£3 




Pr(i,i)W 


£3 


-£3 


1 


1 








. Pr(i,2)(i) 



We design Fourier-ClifFord-Prometheus transforms associated with 
the Abelian groups Z^r^ © Z^Tj © . . . © Z by the same classical PONS 
construction 



7 Y[n+l] 



TCV 



(0[„]0„ + l) 





( 


TD-p 

^^(of„_n,(0}jv„) 






\ 


gArW_i 

O[„]=0 


^Cn„+i 








n~'0'n 

^N„+i 


\ 






Pr(Q,(n _ 


/ 



where is the Fourier-ClifFord transform on the group ’Z^n„+i , (cin + 

(5n+i)n+i := (an + /3n+i) mod Nn+i-, «[„] := (ai,a2, . . . ,an), := 

N 1 N 2 • --Nn, («[„], /3n+i) := (ai, • • • ,an,/3n+i), and, hence. 



((«[„], /3n+i))n := (ai, • • . ,an,/3n+i) mod 
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4. Fast Fourier-Prometheus Transforms 
4.1 Radix- 2 Fast Transforms 

Let us return to the Fourier-Clifford-Prometheus transform 



r pr(o,o)(i) 1 r 1 1 1 -1 1 




= AoIl4(:F2 (g) JF2 )Aq, (5) 

where Aq := diag(Pro(t)) = diag(PrQ,(0)) is a diagonal matrix and 
Il 4 is a special permutation matrix. From this expression we see that 
Prometheus functions up to constant factor are modulated Walsh func- 
tions: 

where (— g,nd (— are the so-called Shapiro multipliers, and 
Wal(Q,^^Q,j)(ti, t 2 ) = (— same result is true in the general 
case for the Fourier-Clifford-Prometheus (2” x 2'^)-transform = 

^ 2 ® ■ • •C>.7^2)Ao, where II 2 " is a special permutation matrix 
and Aq = diag(Pro(t)) = diag(PrQ,(0)) is the diagonal matrix whose diag- 
onal elements form the Shapiro (±l)-multipliers. If a = (ai, « 2 , • • • > On) 
is the binary representation of the number in the ath row of Aq, where 
a* G Z 2 , then for diagonal elements Aq,^q we have the expression Aq^q = 
(— l)Ei=i moi+i_ quantity h{a) = a^ai+i is the number of 

occurrences of the block B = (11) in the binary representation of a, 
(ai, « 2 , • • • ) On)- For this reason the Fourier-Clifford-Prometheus trans- 
form has the Cooley- Tukey fast algorithm 

^p2» = Aon2» [ CT^„ ■ ■ ■ CT^n 1 Ao , (6) 

where := I 2 0 ... 0 ip 2 'Si ■■■ 0 h for i = 1, 2, . . . , n are the so-called 
Cooley- Tukey sparse matriees. 

Now we can prove that an analogous result is true for Davis-Jedwab 
Clifford- valued sequences. Let MC 2 h = {e 2 '>}fc=o^ P® multiplicative 
cyclic group of 2^th roots of unity and £ 2^1 bo ^ 2^th primitive root in a 
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Clifford algebra Cla. Let (ci, C 2 , . . . , Cn) G = Z 2 h © Z 2 h © ... © Z 2 h, 
be an nD vector of parameters over Z 2 h, where is a set of nD vectors 
(labels). Let 

TC 2 {ell)-.= \] Sl = fl fc = l, 2 ,...,n, 

L J ^ 2^1 

(7) 

be a set of (2 x 2)-matrices. Then the tensor product of these matrices 
:= AoHan (.©2(e2i)®-7"2(e2i)©- • •©^2(e2i0) Aq (8) 

gives us new multi-parametric Fourier-Prometheus transforms with fast 
Cooley- Tukey algorithm: 

= Aon 2 » [CTl{e‘:^,)CTl{e^,) • • • C'T 2 T(e^^)] Aq, 

(9) 

where CT^nie^^u) '■= h® ® . . . ® h , and k = 1,2, . . . ,n. 

4.2 Radix-N Fourier-Prometheus transforms 

Let us consider the case of Z|. In this case IFCV ^2 = Aong(.T 3 ©.T 3 )Ao, 
where Aq = diag{Pr(o,o)(iiT 2 )} = diag{Pr(ij^i 2 )( 0 ) 0 )} and Ilg is a spe- 
cial permutation matrix. From this expression we see that Prometheus 
functions up to constant factor are modulated Chrestenson-Clifford se- 
quences (i.e., Clifford-valued characters of the group Z|): 

Pr’(ai,a2)(^i>^2) = Pr(„j^„2)(0>0) Ch(„j^„2)(^i>^2) • Pr(o,o)(ti,i2) 

_Ctl «2 r'U (+ + _ai «2 _ 0 lil© 202 t 2 © 1*2 

— £3 Q,2)vll©2j£3 J —£3 |^^3 ■ £3 J) 

where 

PrK,«©( 0 , 0 )=e“i“^ 

Pl'( 0 , 0 )(il,^ 2 ) = £ 3 *", 

and 

For this reason, this Fourier-Clifford Prometheus transform has the Coo- 
ley- Tukey fast algorithm !FV ^2 = AoIIg Crg^ • CT^ Aq, where CT^ := 

•T's © ^ 3 , CTg = Is © ©" 3 . The same result is true in the general case for 
Fourier-Clifford-Prometheus (3^^ x 3^^) -transforms 

= AoIl 3 n ^©"3 ©©"3 © • • • ©©" 3 ^ Aq = AoIl 3 n CT^nCT^n ■ ■ ■ CT^n Aq, 
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where CT^n := Is ( 8 > • • • < 8 > ^3 < 8 > • . . < 8 > /s for i = 1,2, . . . ,n are the so-called 
Cooley- Tukey sparse matriees. Now we are ready to write the analogous 
expression for Fourier-ClifFord-Prometheus {N^ x -transforms 

TT ’ Afn = AoIIjV" < 8 > < 8 > • • • < 8 > Aq 

= Aonjv" [CT^„CT^„ • • • Aq, (lo) 

where CTf^„ := I^r (g) . . . (g) F]\f (g) . . . (g) /jv for i = 1,2, ... ,n are the 
so-called Cooley-Tukey sparse matriees. The same result is true in the 
general case for the Fourier-ClifFord-Prometheus x N["^l)-transForm 

FV atW = AoIlArr. {Fn^ <g) Fn 2 ® ® FNr^j^O 

= AoIlArn Aq, (11) 

where Aq := diag(Pro(t)) = diag(PrQ,(0)) is a diagonal matrix and IlArn 
a permutation matrix, and C'T^[„] := /jvj (g) . . . (g) FNi 0 ■ ■ ■ 0 In„ for 
z = 1, 2, . . . , n are the so-called Cooley-Tukey sparse matriees. 

5. Conclusions 

We have shown how ClifFord algebras can be used to Formulate a new 
unified approach to so-called generalized Fourier- Clifford- Prometheus 
transforms. It is based on a new generalized FCPT-generating con- 
struction. This construction has a rich algebraic structure that supports 
a wide range of fast algorithms. This construction is associated not with 
the triple , F 2 , , but rather with other groups instead of Zlf, other 

unitary transforms instead of F 2 , and other algebras (ClifFord algebras) 
instead of the complex field C. 
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Abstract This paper present a new approach to the Color Fourier Transformation. 

Color image processing is investigated in this paper using an algebraic 
approach based on triplet (color) numbers. In the algebraic approach, 
each image color pixel is considered not as a 3D vector, but as a triplet 
(color) number. The so-called orthounitary transforms are introduced 
and used for color image processing. These transforms are similar to 
a fast orthogonal and unitary transforms. Simulations using the color 
Wavelet-Haar-Prometheus transforms on color image compression have 
also been performed. 



Keywords: Clifford algebra, color images, color wavelet, edge detection, orthouni- 
tary transforms. 

I. Introduction 

Fourier analysis based on orthogonal and unitary transforms plays 
an important role in digital image processing. Transforms, notably the 
classical discrete Fourier transform, are extensively used in digital image 
filtering and in power spectrum estimation. Other Fourier transforms — 
e.g., the discrete cosine/sine transforms, wavelet transforms — are often 
employed in digital image compression. All the above-mentioned trans- 
forms are used in digital grey-level image processing. However, in re- 
cent years an increasing interest in color processing has been observed. 
Our approach to color image processing is in using so-called color triplet 
numbers [l]-[7] for color images and to operate directly on three-channel 
(RGB-valued) images as on single-channel triplet-valued images. In the 
classical approach every color is associated to a point of the 3D color 
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RGB vector space. In our approach, each image color pixel is considered 
not as a 3D RGB vector, but as a triplet (color) number. 

A natural question that arises in our approach is the definition of 
color (RGB-channel) transforms that can be used efficiently in edge 
detection and digital image compression. The so-called orthounitary 
(triplet-valued or color-valued) Fourier transforms are introduced and 
are used for color image processing. These transforms are similar to fast 
orthogonal and unitary transforms. Therefore, fast algorithms for their 
computation can be easily constructed. Simulations of application of 
color transforms to color image compression have also been performed. 
The main contributions of this paper are: a) the definition and analysis 
of properties of the orthounitary (color) Fourier transforms (in partic- 
ular, color Wavelet-Haar-Prometheus transforms); b) showing that the 
triplet (color) algebra can be used to solve color image processing prob- 
lems in a natural and effective manner. 

2. Color images 

The aim of this section is to present algebraic models of the subjective 
perceptual color space. The color representation we are using is based 
on Young’s theory (1802), asserting that any color can be visually repro- 
duced by a proper combination of three colors, referred to as primary 
colors. The color image appears on the retina as a 3D vector-valued 
((R, G, R)-valued) function 

fcoz(x) = (^/ij(x), /g(x), /b(x)^ = /ij(x)iij -F /c(x)iG + /s(x)iB, 

where /r(x) = A)Hji(A)dA, /g(x) = A)HG(A)dA, and 

/s(x) = J^s^^^{x,A)HB{A)dA, s°^^{x,A) is the color spectrum received 
from the object, Hb{A), Hb{A), Hb{A) are three photoreceptor (cone 
or sensor) sensitivity functions, A is the wavelength and i/j := (1,0,0), 
iG := (0,1,0), i_B := (0,0,1). 

In our approach, each image color pixel is considered not as a 3D 
RGB vector, but as a triplet number in the following two forms (see 
[1] in this book): fcoz(x) = /ij(x)lcoZ + /g(x)£coZ + /s(x)e^^;, fcoz(x) = 
fiu{^)^iu + fGZi(x)EG/i, where = 1. The first and the second expres- 
sions are called the AI 3 (R| 1, £coZ, e^z)" Al 3 (R, C)- representations of 
eolor image, respectively. Numbers of the form C = x A- yScoi + ^£coZ 
are called the triplet, 3-eyele, or eolor numbers. Every color num- 
ber C = X A ye^ -|- ze^ is a linear combination C = x A ys^ -|- ze'^ = 
aiueiuAzch^Ch = {aiu,zch) of the “scalar” aiuBiu and “complex” parts 
zch^Ch in the idempotent basis {e/„,EG/i}. Real numbers aiu G R we 
will call intensity (luminance) numbers, and complex numbers zch = 
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b + jc £ C are called the chromaticity numbers. Thus /(^(x) is a 
real-valued (grey-level) image and fch is a complex-valued (chromatic- 
valued) image. 

A 2 D discrete color image can be defined as a 2 D array fcoi '■= 
[^coi{h j)\fj=i i-e., as a 2 D discrete -valued function in one of the 
following forms fcoz(bj) : fcoz(bj) : Z^ — > 

^|°^(R, C), (i,j) G Z^. Here, every color pixel fcoz(^,j) at position 

(i,j) is a color number of the type 

fcoz(b j) := + /g(b + /Z)(b j)e^ 

or of the type 



fcoz(b j) := fiu{i,j)eiu + /czi(bi)Eczi- 



The set of all such images forms A^^D Greaves-Hilbert space L(Z 



Af) 



R 






col\N'^ 
3 ) 



— ("^3 



= R^"l + 






+ R^^e 2 



= R"^'ez„ + C 



AT 2 



Af> 



Eczi = 



where R^''*l, R'^ e^, 



t>N^ 



are real A^^D Hilbert spaces of 



red, green, and blue images, respectively, R'^ is the A^^D real space of 
gray-level images, and is the A^^D complex space of chromaticity 
images. 



A color linear operator £20 ■ ^ ’ ^2d[^coi] = 

Fcoz is said to be orthounitary if ~ ^2D- Orthounitary opera- 
tors preserve scalar product (.|.) and form orthounitary group trans- 
forms OU(^3°^). This group is isomorphic to the direct sum of orthog- 
onal and unitary groups 0 (M)e/„ -|- U(C)Ec/j and every element has 
the representation £20 = 02 D^Iu + hl 2 D^ch-, where O2D £ 0 (R) and 
hl2D G U(C) are orthogonal and unitary transforms, respectively. For 
color image processing we shall use separable 2 D transforms. The or- 
thounitary transform £2D[fcoz] = FcoZ is called separable if it can be 
represented as FcoZ = £2D[fcoz] = ^id[^coi\^2D , i-e. £21) = ^\D ® ^iD 
is the tensor product of two ID orthounitary transforms of the form 
S^2D = ^ID ® -CiD = (Oi (g) 02)e;„ {Ui (g ^2)Ec/i, where 0i,02 and 
lA\^li2 are ID orthogonal and unitary transforms, respectively. Thus, 
we can obtain any orthounitary transform, using any two pairs of or- 
thogonal 0i,02 and unitary transforms ^i,^2- In this work we shall 
use one pair of orthogonal and unitary transforms where Oi = O2 = 
O and IA\ = IA2 = U. In this case we obtain a wide family of or- 
thounitary transforms of the form £21? = (O (g) 0 )e;„ + iU ® U)^ch 
using different ID orthogonal transforms. In this work we shall use 
the more simple orthounitary transforms £ = O • -|- U • ^ch with 

U = O • diag(zo, 2^1, . . . , 2 ;at-i), where diag(zo, 2^1, . . . , zat-i) is a di- 
agonal matrix of chromatic (complex) numbers. In this case we have 
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£ = 0-e/„ + 0-diag(zo,zi , . . . , ZN-i)-'Ech = O -diag(Co, Ci, . . . ,Cn-i), 
where ■= + Zk^ch, k = 0 , 1 , . . . , N — 1 , are color (triplet) num- 

bers. Hence, our orthounitary transforms are represented as the product 
of an orthogonal transform O and a triplet-valued (color) diagonal ma- 
trix diag(Co,Ci, . . . ,Ctv-i)- A large number of orthounitary transforms 
can be devised by appropriate choice of the orthogonal transform O and 
the diagonal transform parameters Cq,Ci, . . . ,Cn-i- 



3. Color Wavelet-Haar-Prometheus transforms 

One of the aims of this paper is to define three-channel transforms 
(the so-called color Fourier transforms) that could eventually be used 
in digital color image compression. We have experimented with the 
so-called color Wavelet-Haar WH2", color Wavelet-Haar-Prometheus 
WHP2" , Wavelet-Haar- Hartley \VHH3n , and Wavelet-Haar-Hartley- 
Prometheus \VHHP 3 n, transforms [8]-[9]. 

Orthogonal and unitary Wavelet-Haar and Wavelet-Haar-Prometheus 
transforms have the factorizations: WHP2" = WH2"A2n, WTCP2" = 
W?f2"A2n, and 



WH2n = 



i=l 



(I 



2^n — i \ 



L2n 



( 1 ) 



WW2" 

respectively, where 



n 

2=1 *- 






( 2 ) 




c 



12'n.— (S' [1 1] 

— i (S' 1 1 




c 



<8) [1 CU3] 

I2TT- — i ^ 1 ^3 



Fo = 



1 1 
1 -1 



and T2 = 



products of the identity matrix l2n-i 


1 (J 3 


-V2 


1 1 




1 


1 -UJ3 _ 


2 


1 -1 




W3 _ 



re- 



spectively. Here, c = F2 is the classical Walsh transform and T2 
is the complex ws-deformed Walsh transform, where 
and A2" is a diagonal matrix, whose diagonal elements form the Shapiro 
(±l)-sequence. If a = (cci, 02, • • • , «n) is the binary representation of 
the number of the ath row of A2", where a* G Z2, then for diago- 
nal elements A^^q we have the expression Aq^q = (— l)^i=i The 

quantity 6(a) = YllZi otiCti+i is the number of occurrences of the digi- 
tal block B = ( 11 ) in the binary representation (ai,a2, . . . ,an) of the 
number a. 
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Using two pairs (WH2", VV’'H2") and (WHP2», VV’H'P2") of fast or- 
thogonal and unitary Haar- Wavelet transforms (1-2), we construct ID 
color Wavelet-Haar and Wavelet-Haar-Prometheus 2”^-point fast trans- 
forms as 



— WH2 " W’?Y2" •E(7/i 



n 

\i=l 



(I2”-- ©I 2n 



( 3 ) 



^ 2 " — f AVHP 2" ' *5^u + V^VP'2^ ' 2^2" — 



n 



(^2^- i© ^2)©I 2n 



A9 



\i=i ■- 

where 5^2 = F2 • -h J^2 • Ech = 



2 



1 1 
1 -1 



• OZii + 



1 U3 
i -W3 




2 




( 4 ) 



We see that the color Haar- Wavelet transform has the same fast algo- 
rithm as the orthogonal and unitary transforms in Eqs. (1). Note that 
the product of e with a color pixel fcoz = (//?, /g,/b) = /ijl+ZGe^+Z^e^ 
is realized without multiplications as the right shift of color components 
efcoz = £(/r, /g, Ib) = + /g£^ + fse^) = (/si + /r£^ + fse^) = 

(/S) fn, /g)- In this case the situation is the same as for Number Theo- 
retical Transforms. 

The next example of color (orthounitary) Haar-like wavelet transforms 
is based on Haar-Hartley H3 (3 x 3)-transforms. Using these transforms 
we construct an “elementary” three-point color transform of the follow- 
ing form in the ^3(R-, C)-algebra: .^5^3 = H3 • -h • Bch = 



1 

71 



1 1 1 

1 hi /i2 

1 /z-2 hi 



^lu + 



73 



1 1 1 
1 hiOj\ h20J3 
1 / 12 W 3 



Ecfc = 



1 

71 



■ 1 


1 


1 


1 

■71 


■ 1 
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1 




■ 1 




1 


hie^ 


h2S^ 


1 


hi 


h2 






el 


1 


h2E^ 


hi£^ 


1 


h2 


hi _ 






e2 _ 



( 5 ) 



where hi := cas (^) = cos (^)-l-sin (^) , /12 := cas (^) = cos (^) + 
sin (^) and ozg := cis (^) = cos (^) -|- isin (^) , := cis (^) = 

cos (^) +i sin (^) • We use the orthogonal and unitary Wavelet-Haar- 
Hartley and Wavelet-Haar-Hartley-Prometheus 3"^-point transforms that 
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follow: AVHP 3 n — AVH 3 n.A 3 n, yV'ht'P^n — yV'ht^'Ti ^ and 



WH 3 ™ = 



2=1 



(1371- — F3) ® l 3 ri_ 3 ri — i + 1 



WW 3 " = n 

respectively, where 



i=i 



(1372 — i (S' * 2 ^ 3 ) tD l3n_27^ — 7 + 1 



( 6 ) 

(7) 



0 


l3n-i(g)[l 1 1 ] 


0 


I377— U -^3 U -^3 I 


1371. — P3 ~ 


1377.— h\ h2 ] 


, l3n-i(8> = 


I377— ^1^3 ^ 2^3 ] 




1377— h\ ] 




l3n-i(8>[l h2U!^ hlU!^] 



are the generalized tensor products of the identity matrix l 3 n-i with 






1 1 1 
1 hi h2 
1 /i2 hi 



and Hs = 

v/3 



1 1 1 

1 hiOJ^ /l2i^3 

1 /l2<^3 /iiw| 



respectively, and A^n is a diagonal matrix whose diagonal elements form 
the 3-point Shapiro ws-sequence. If a = (ai, a 2 , • • • , an) is the 3-ary 
representation of the number in the ath row of where a* G Z 2 , then 

for diagonal elements we have the expression A„^q = 

Using these fast orthogonal and unitary Wavelet-Haar-Hartley-Pro- 
metheus transforms (6-7) we construct ID fast color Wavelet-Haar-Hart- 
ley-Prometheus 3^-point transforms by = 2 U^ 3 nA 3 n and 



(8) 



21113371 — I ]3 (l 3 ™-i( 8 > -135^3) © l 3 "_ 37 i-i + l 

Vi=l . 

where 13^3 is the color 3-point Hartley transform (5). 



4. Edge detection and compression of color 
images 

One of the primary applications of this work could be in edge detection 
and color image compression. For edge detection, we convolve the color 
(3 X 3)-masks mcoi{i,j) with a color image fcoiiuJ) of size N x N : 

f{hj)= ^ rncoi{k,l%oi{i - k,j - 1). 

We use color Prewitt ’s-like masks for detection of horizontal, vertical, 
and diagonal edges. As entries instead of real numbers these masks have 
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Figure 1. Color edge detector. Left: original image, right: detected edges. 



triplet numbers: 
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The effect of the masks in homogenous and nonhomogenous color regions 
differs substantially. Let us analyze both cases in detail. At any position 
(i, j) of a homogenous color region after convolution we get 



fcoZ = 






IR 

ell 



m+fF+fF 
fF + fF + fF 



f31 

JR 

fSl 

JG 

fSl 

JB 



+ /f 
+ /f 
+ /f 



+ /f 
+ /f 
+ /f 



= 0 , 



since, for all 9 pixels we have = fi3 = f2^i = f 22 ^ f23 ^ 

Fol ~ Fol ~ Fol ~ color const = C. For horizontal nonhomogenous 
color regions we have = Ci = aj^ei^ + bfzlf^Ech, and 



f31 _ c32 _ 
^col ^col 



= C2 



fii 

Jr 

fii 
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fll 

JB 



+ fF 
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+ fF 
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afu^iu + '^Ih^ch- Hence, fcoZ = 
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[<\ 



Fig. 1 shows the result of color edge detection. 

We have performed a number of simulations on the use of orthouni- 
tary transforms in the area of color image compression. We have exper- 
imented both with 2” x 2^ and 3” x 3”-pixel images by using 2BidiP2": 
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and WHP2", \VHP3n transforms. Figures 2-3 illustrate the 
and WHPs^, WHPgn transforms of the color image “BA- 
BOON” after the first iterations of the fast algorithms of these transfor- 
mations. Examples of 2D color histrograms of chromaticity planes for 
different spectrums are shown in Fig. 4. 




Figure 2. Left: original “BABOON” image, right: wavelet spectrum of “BABOON” 
after the first iterations of . 



5. Conclusion 

A system of color-valued 2D basis functions has been defined in this 
paper. This system can be used to obtain color orthounitary Fourier 
transforms and series analyses of color images. Properties of the color 
Fourier transforms are presented. It is shown that such color series have 
properties similar to the classical orthogonal Fourier series. A family 
of discrete color orthounitary 2D Fourier transforms has also been pre- 
sented that can be used in color image compression. In particular, the 
color Wavelet-Haar-Prometheus transforms are defined and used to ob- 
tain the color Wavelet-Haar-Prometheus series. 

The analysis presented in this paper provides a very general frame- 
work for the definition of other multicolor transforms based on multiplet 
hypercomplex numbers. The derivation of such multicolor transforms 
is the subject of ongoing research. The motivation of this ongoing re- 
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Figure 3. Wavelet spectrum of “BABOON” after the first iterations. Left: WH2", 
right: 2Ui53»- 




Figure 4- Examples of 2D histograms of the chromaticity plane. From top to bottom 
from left to right: original “BABOON” image, WH3 spectrum, WSjs spectrum, 
spectrum. 



search is to define multicolor transforms that can be used efficiently in 
multicolor satellite image compression. 
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1. Transformations of Euclidean Space and 
Clifford Geometric Algebra 

Georgi H. Georgiev 

Shumen University 
Shumen 9712, Bulgaria 

g.georgiev@shu-bg.net 

I. 1 Similarity transformations 

Any direct similarity transformation of Euclidean affine space is a 
product of an orientation-preserving homothety and a proper rigid mo- 
tion. In other words any direct similarity transformation of preserves 
the orientation and the angles (see [1] for details). Let Sim~^{£^) denote 
the group of all direct similarities of 

If we identify the Euclidean plane £‘^ with the field of complex numbers 
C , then / G Sim^{£‘^) has a representation 

f{z)=az + (3, zeC = S‘^, 

where a € C\{0} and /3 G C are constant. 

In the case n = 3, we may identify three-dimensional Euclidean space 
with the space of pure quaternions Im H. Then an arbitrary similarity 
/ G Sim~^(£^) has a representation 

/(z ) = A n z + a, z G ImH = 

where A G M"*", n is a fixed unit quaternion and a a fixed pure quaternion. 
Clearly, / is a homothety whenever A 7 ^ 1 and n = ±1, / is a translation 
whenever A = 1 and n = ±1, / is a rotation whenever A = 1 and a = 0. 

The next step is the discovery of a common representation of the 
similarities by one equation in any dimension. 

Problem 1. Using Clifford geometric algebra find a representation 
of / G Sirn^{£'^) for any n > 1 in terms of the geometric product. 
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Note that the dilations, rotations and translations can be represent 
by rotors (see [3]). 

1.2 Quadratic transformations 

Let Q be a non-degenerate quadric hypersurface in the real projective 
space P”, n > 1, let p be a point lying on Q, and let r be the tangent 
hyperplane to Q at p. Then the pair {Q, p) determines a quadratic 
involutory transformation 

If : P”\r — >P’"\r 

with the set of fixed points Q\ p. In fact for x G P”\r, 
if{x) = X = Po1q(x) n (px) , 

where Po1q(x) is the polar hyperplane of x and (px) is the straight line 
passing through p and x. If P ” is the projective extension of and r is 
the hyperplane at infinity, then p is a one-to-one transformation of T”. 
The three-dimensional Clifford algebra ^3 and techniques from [4] and 
[5] are applied in [2] for a description of the case n = 2. 

If Q=p + \q + \‘^r (AgM) is a conic in P where p ^ Q, 
q and r are non-collinear points in the projective plane, the line L = 
p A X = (px) passes through p and x, and meets the conic Q at a second 
point x° (see [2]). 

Then cp has the equation 

X =2p ■ X [x^ ■ X — X ■ x) p + [p ■ X p ■ x^ — p ■ p x^ ■ x) X , 

where the second intersection point x° = [rpx] ^p — [qpx] [rpx] q -|- 
[qpx] ^r, [p q r] = {p A q A r) , and I is the unit pseudoscalar in 

Gs- 

Problem 2. Using Clifford geometric algebra obtain an explicit equa- 
tion for p for n >3 . 

The techniques stated in [6] can be used for the solution of the above 
problems. 
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2. On the Distribution of Kloosterman Sums on 
Polynomials over Quaternions 
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Abstract 

We formulate two problems concerning the distribution of angles of 
Kloosterman sums and the algebraic structure of curves over quater- 
nions. 

Keywords: density of a distribution, elliptic curve, Kloosterman sum, quaternions. 

2.1 The Problem of the distribution of 
Kloosterman sums 



Let 

cx+d 

X=1 

1 < c, d < p — 1; X, c, d G F* 

be the Kloosterman sum. 

By the A. Weil [1] estimate 

Tp{c,d) = 2 y/p cos 9p{c,d), 
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0p{c, d) 



arccos( 



Tp{c, d) 
2VP 



on the interval [0 , tt) : 

Problem 1. Let c and d he fixed and p vary over all primes not 
dividing c and d. What is the distribution of angles 9p{c, d) as p ^ oo 7 

Motivation. Kloosterman sums of different types have in recent 
years played an increasingly important role in the study of many prob- 
lems in analytic number theory [2]. 

During 1983 and 1989 the presenter of this problem implemented compu- 
tations with Kloosterman sums Tp(c, d) for primes p, 2 < p < 13499, c = 
d = 1, and for l<c<p — 1, 1 < d < p — 1, for some primes p [3, 4]. 

Conjecture. Under the eonditions of Problem 1 the angles 6p{c, d) 
are distributed on [0, tt) with Sato- Tate density ^ sin^ t. 

2.2 On algebraic structure of curves over 
quaternions 

Let E be the elliptic curve over a field K, E{K) the group of rational 
points of E. E{k) is an abelian group with a finite number of generators 
(Poincare, Mordell, Weil). 

Let End{E) = Hom{E,E) be the ring of endomorphisms of E. In some 
cases the ring End{E) is noncommutative (Deuring). 

Example. Let KT = F22 and E be defined by the equation 

y"^ + y = x^. 

Let a{x, y) = {sx, y) where e^-|-e-|-l, eGF22 and Frobenius 4>{x, y) = 
Then, since 7^ e, a o (p(x,y) 4> o a{x,y). Moreover the 

ring End{E) is isomorphic to the set of integer quaternions End{E) = 
{mi -|- ni 2 i + msj + m^k, m* G Z or m^ -|- 1/2 G Z}, so F is a supersin- 
gular elliptic curve. 

Let H = joi -|- 02i -|- asj -|- a^k, a* G R} be the ring of quaternions 
and 



EH : y'^ = x^ -\- ax -\- h 



be the equation with a, 6 G H. 
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Problem 2. What can be said about the algebraic properties and 
structure of the set 

EH{H) = {{x, y) eHx-Uky"^ = x^ + ax + b} 
and the set Mor{EH{H)) of maps EH{U) EH(H) ? 
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vladimir_krakovsky@uasoiro.org.ua 

Harmonic sliding analysis (HSA) is a dynamic spectrum analysis [1] 
in which the next analysis interval differs from the previous one by in- 
cluding the next signal sample and excluding the first one from the 
previous analysis interval. Such a harmonic analysis is necessary for 
time- frequency localization [2] of the analysed signal given peculiarities. 
Using the well-known Fast Fourier transform (FFT) is not effective in 
this context. More effective are known recursive algorithms which use 
only one complex multiplication for computing one harmonic during each 
analysis interval. The presenter of this problem improved one of those 
algorithms, so that it became possible to use one complex multiplication 
for computing two, four and even eight (for complex signals) harmon- 
ics simultaneously [3], [4], [5]. One problem of HSA was mentioned in 
[6]. In [7] there is a short review of the presenter’s papers devoted to 
HSA. A basic problem of HSA is: are there methods to further increase 
the speed of the response using up-to-date nanotechnology? If this is 
possible, how does one do it? 
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4. Spectral Analysis under Conditions of 
Uncertainty 

Sergey I. Kumkov 
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In the classic spectral analysis, a sample {xn-,tn-, n = 0,A^ — 1} of 
the discrete time periodic process is the value of the process, tn 
is the argument; N is the sample length) is expanded in the spectrum 
C = [Ck = Xn^k{tn ))/N, k = 0, N - 1} by using some basis 

system {4>k{tn), k = 0, N — 1] n = 1,A1 — 1}. The reverse representation 
of the process is {xn{C) = "Ylik^o Ck<t)k{tn)}^ respectively. 

Usually the sample is composed of the disturbed values with some 
model, for example, the model of disturbance can have the form Xn = 
x*+6n,n = 0,A^ — 1, where x* is the unknown true value of the process, 
£ = {^n} is the additive disturbance. 

If the disturbance has a stochastic nature and is described by some 
probability characteristics, there are classical approaches for description 
and computation of the spectrum for this stochastic process. 

But in practice, often the statistical properties of disturbance are 
unknown, the researcher can show only the geometric constraint je^l < 
£maxi and the sample gives the collection of the uneertainty sets {Hn = 
[xn — £max,Xn + £max]}- Additionally, the type of the process can be 
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given, i.e., x*{t) = f{t,P), where /(•) is the describing function, P is 
the vector of parameters of dimension M, and M < N. 

Now, each component Ck mentioned above becomes a set Ck = Ck + 
iJ 2 n =0 £n^k{tn)) / N , where all £„ G [-£max,£max], and the process spec- 
trum C becomes the body in the space generated by the taken basis {(pk}- 
The researcher is interested only in the vectors C G C consistent with 
the shown collection of the uncertainty sets, i.e., Xn{C € C) € Hn for all 
n = 0, — 1. 

Problem. Since the sets Ck depend on the vector of the disturbance 
£, they are not mutually independent and the spectrum C is not a rect- 
angular parallellotop. How to describe and to compute the frontier of 
the body-spectrum C consistent with the collection of the uncertainty sets 
and the given type of the descriptive function? 

5. A Canonical Basis for Maximal Tori of the 

Reductive Centralizer of a Nilpotent Element 

Alfred G. Noel 

Mathematics Department 

The University of Massachusetts 

Boston, Massachusetts 02125-3393, USA 

anoel@math.umb.edu 



Abstract We describe a problem that we partially solved in [1], [2] and [3] using 
a computational scheme. However a more conceptual argument might 
be very enlightening 



5.1 Problem Description. 

Let g be a real semisimple Lie algebra with adjoint group G and g^. 
its complexification. Also let g = 6 0 p be the Cartan decomposition of 
g. Finally, let 9 be the corresponding Cartan involution of g and a the 
conjugation of g,, with regard to g . Then g^. = 6... 0 p,j^. where L, and p^. 
are obtained bycomplexifying 6 and p respectively. Denote by /y,. C G,.. 
the connected subgroup of the adjoint group of g,.. with Lie algebra 

e,. 

A triple {j;,e, f ) in g,. is called a standard triple if [x, e] = 2e, [x, /] = 
-2/ and [e, f] = x. If x G 6.,, e and / € p^, then (x.e.f ) is said to be 
normal. It is a result of Kostant and Rallis that any nilpotent e of p,,. 
can be embedded in a standard normal triple {x,e,f ). Let be the 

centralizer of (x.e, /) in t,,,. 
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Maintaining the above notations we would like to solve the following 
problem: 

5.2 

PROBLEM: Let t be a Cartan subalgebra of such that x G t. Then 
find two nilpotent elements e and / in such that (x, e, /) is a standard 
triple and ti = t n ig g, maximal torus in More precisely 

give a natural basis for ti . 

The reader should be aware that in general ti 7 ^ for an arbi- 

trary e. A counterexample can be found in [1], Furthermore, there is 
currently no good characterization of such a torus in the literature. Our 
conversation with several experts led us to believe that such a character- 
ization may be quite technical. We wish to thank Prof. Andreas Dress 
from Bielefeld University for simplifying the statement of the problem. 

5.3 Partial Results. 

When 0 is a real form of an exceptional non-compact simple complex 
Lie algebra we have developed a computational scheme to compute ti. 
These results are found in [1], [2], and [3] . Here is a concise version of 
the algorithm: 

Algorithm 

Input: 0 is a real exceptional simple Lie algebra, = {/?i, . . . , a set 
of fundamental roots of 6 (., and t is a Cartan subalgebra of 0 ^ define by 
Afe. 

Computation 

■ Compute X using the values of fli{x). 

r 

■ Using X, express e = ^ c.y.A.y. , where 7* is a non compact root. 

i=l 

A.y. a non zero root vector and fo. a complex number, in a regular 
semisimple subalgebra of minimal rank r. Create the normal 
triple (x,e,/). 

■ Compute the intersection of the kernels of 7 * on t. Observe that 
the eomplex span of sueh an interseetion is a maximal torus in 
gU,e,/) 

Output: ti is the complex span of the intersection computed above. 

We do not have at this moment a good way of describing a natural 
basis for ti in general. The above choice of e suggests that e must 
be given as a linear combination of a minimal set of root vectors. In 
designing the above algorithm we used some information about the type 
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of from the work of Dragomir Djokovic. We anticipate that we 

will need such information for the classical groups where the nilpotent 
orbits are parametrized by partitions. 
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6. The Quantum Chaos Conjecture 
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According to the general formulation of the quantum chaos conjecture 
introduced in [1] (see also [2]), the distribution of the distances between 
adjacent energy levels of a quantum system should be close to a Poisson 
distribution with density e~^ provided such a system is a quantum ana- 
logue of a classical integrable system. The quantum chaos conjecture 
has not been proved previously for any system. The quantum system is 
described by the Schrodinger equation and obtained from the classical 
system with the help of some standard procedures. We present a class of 
quantum systems containing as a special case the well-known and pop- 
ular model of a rotating particle subject to (5-shocks (“kicked rotator”), 
which in many papers has been considered as one of the main objects in 
connection with confirming the conjecture. 

The proof of the quantum chaos conjecture for this model makes essen- 
tial use of results on the distribution of distances between adjacent frac- 
tional parts of the values of a polynomial, and the estimate of the remain- 
der is based on a new theory of generalized continued fractions for num- 
ber vectors ([3] -[6]). We consider a one-dimensional nonlinear oscillator 
determined by the Hamilton function H = H{4>, I, t) = Hq{I) + t): 

dtp _dH _ dHo dl _ dH _ dHi 

dt dl dl ’ dt d(j) d(p ’ 



( 1 ) 
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where I, (j) are “action-angle” variables, t is an independent variable 
and the function has period 27 t with respect to (p and period 

T > 0 with respect to t and can be represented as 

OO 

= F{P) Y, Kt-kT), (2) 

k=—oo 

with F((p) a smooth 27r-periodic function and S = 5{t) the delta-function. 

We assume that Hq(I) = is a polynomial of degree n > 

2 with coefficients bg = ag/bP (s = where b is the Planck 

constant and the Ug are real numbers. In the special case when n = 2, 
oq = oi = 0, and F{(p) = 'jcosp (7 is a constant), the system (1) 
represents a “kicked rotator”. We introduce the Hilbert space of 
complex 27T-periodic functions as the state space of the quantum system 
and we define the momentum operator by / = f The evolution of 
the wave function 'k = \k(0, t) G is described by the Schrodinger 
equation 



where H{t) = Hq + Hq = is the limit 

as e ^ 0 (e > 0 ) of the operators of multiplication by the function 
obtained from the function Hi in ( 2 ) when the delta-function S is 
replaced by a smooth positive function supported on the interval [ 0 , e] 
and having integral 1 . 
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7. Four Problems in Radar 

Michael C. Wicks and Braham Himed 
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For more context on these four problems, please consult the chapter 
in this book by the same authors, from which this section is excerpted. 

As analog hardware performance matures to a steady plateau, and 
Moore’s Law provides for a predictable improvement in throughput and 
memory, it is only the advances in signal and data processing algorithms 
that offer potential for performance improvements in fielded sensor sys- 
tems. However, it requires a revolution in system design and signal 
processing algorithms to dramatically alter the traditional architectures 
and concepts of operation. One important aspect of our current research 
emphasizes new and innovative sensors that are electrically small (on the 
order of 10 wavelengths or less), and operate in concert with a number of 
other electrically small sensor systems within a wide field of view (FOV) . 
Our objective is to distribute the power and the aperture of the conven- 
tional wide area surveillance radar among a number of widely disbursed 
assets throughout the battlefield environment. Of course, we must have 
an algorithm for distributing those assets in real time as the dynamically 
changing surveillance demands. The mathematical challenge here relates 
to the traveling salesman problem. Classically, the traveling salesman 
must select his route judiciously in order to maximize potential sales. 
Recent analysis in the literature addresses multiples salesmen covering 
the same territory. This is analogous to our problem, where multiple 
unmanned aerial vehicle (UAV) D based sensors are charged with the 
mission of detecting, tracking, and identifying all targets (friend or foe). 
Not only must these sensors detect and identify threat targets, they must 
also process data coherently across multiple platforms. Our mathemat- 
ical challenge problem reduces to one in which the position and velocity 
all UAV-based sensors are selected to maximize detection performance 
and coverage area, and minimize revisit rate. 

Enhancing one of the sensors described above, to be more like a clas- 
sical radar with a large power-aperture product, leads to the second 
mathematical challenge problem to be addressed by this community. 
With a larger aperture and more precise estimates of target parame- 
ters (angle, Doppler), an opportunity to expand the hypothesis testing 
problem to include both detection and estimation emerges. Here, con- 
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ventional wisdom dictates that we perform filtering and false alarm rate 
control as part of the detection process, yet perform track processing 
as a post-detection analysis, where the parameter estimation is focused 
upon target position and velocity history. Clearly, parameter estimation 
need not be accomplished as a post-detection process. Since this seg- 
mented approach to detection and track processing has been in effect for 
decades, it will require a dramatic demonstration of improvement before 
it will be embraced by the radar community. 

A third challenge problem arises in the formulation of the Generalized 
Likelihood Ratio Test (GLRT). In Kelly’s formulation of a GLRT, con- 
ditioning on finite sample support is incorporated into the basic test. As 
such, a statistical method developed under the assumption that only fi- 
nite training data are available for sample covariance matrix formulation 
was made available. The next generalization to be made, in an extension 
of Kelly’s GLRT, is to incorporate prior knowledge of the structure of 
the sample covariance matrix into the mathematical development of a 
statistical test. This mathematical structure arises due to the fact that 
the phase spectra of ground clutter as seen by an airborne radar is de- 
termined only by geometry, and remains independent of the underlying 
clutter statistics (except for initial phase). The effect of this geometric 
dependence is to localize the interference along a contour in the trans- 
form domain (Fourier analysis). Our objective is to formulate a single 
GLRT which incorporates the effects of finite training data as well as 
geometric dependence. 

The fourth mathematical challenge facing the modern radar engineer 
is to incorporate adaptivity on transmit into the basic formulation of 
the signal processing algorithm. Since this is a new research topic, an 
opportunity exists to formulate the basic mathematical framework for 
fully adaptive radar on both transmit and receive. Further extensions 
arise by incorporating the above challenge problems into this analysis. 
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